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Abstract 

Thii reportpresentt > r»v«l n«we^-Ulw r i pw it n^w fy fffPraaaUon, called 
"constraint expressions* (CE). CE make! use of some of the knowledge-representation 
techniques developed by Artificial Intelligence retearch. A CE network consists of points 
(which represent classes of objects) interconnected bv constraints (which represent the 
relationships which are krrowa * fcofcj amee*Hi* eJesfet). AS constraints are defined in 
terms of six primitive ones. The dau in a CE network is accessed by propagating various 
kinds of labels through it: Each constraint can be viewed at an active process which looks 
for certain patterns of labels on some of Its attached points, and then propagates new labels 
to other points when such patterns oeew. 

The CE representation provides several significant features which are not found 
in most current data models, first, the same mechanism is used to represent "general" as 
well as "specific* information. For example, "The sex of jane Smith U female" is specific, 
while "Every person has a unique sex which is either 'male* or ^female* is general. 

Second, CEViabeVpropagation procedure implements logical consistency checking: 
Data-base integrity can be maintained by checking all new data for consistency with the 
existing information. Since the data-base can contain general information (representing a 
"semantic model* of the data-base's application domain), new specific data can be rejected if 
it is inconsistent with either other specific data or wtth the general information. Also, the 
general mformaikm can itself be diecked for internal cohfueency. 

Third, the CE representation is sufficiently modular and well-defined so that it 
has a precise format wwatmes, whid* insures that CE's definition contains no hidden 
ambiguities or contradictions. ■ 

Fourth, CEs modularity allows the label propagations to be done in parallel, so 
that paraHet hardware can be used to f u»l advantage. 
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7 Introduction 

Induction 

The work reported in this doctit^tpo^^m ^ m»m^'\^r^^iv\t^on for 
information called "conuralnt expreuions" (CE). Thij reprewnttUoo has two major 
features which distinguish it from most others. The fint is ihat it deals comfortably with 
incomplete information. For example, unliie many oth«r fepresenutiom, A^datar^base 
structured, in terms of f& can easily contain in/orrf^Uon abo# ; the clau of >^ persons" 
even when the data-base does not contain a complete list of all of them. This feature 
allows the data-base to contain both "specJfic'jnformaUon (such as "Mary Smith is the 
mother of Jane Smith") and "general" informaUon (such **,*f*ch pjfpo^ has a unique 
mother"). 

The second major feature is that tMjPl^r^resp||^^ ha^ a well-defined logical 
semantics, which precisely defines the meaning. of every p|eoi of f any given CE ne^ork. 
Many other representations lack an adequate logical semantics^ which makes Jdjffjcujt to 
understand them in a coherent way and hinders one from cpmpar,ing their relative 

strengths and weaknesses. In addition^CEyserftanHcj '•» - "jp^Tf^iq^lH^^Cr ■* t *-'? , ?5 l f?Ai ,, * E -~ , ^PC**?^^ , in 
that it specifies not only what any given expression means, but also how, |o compute 
inferences from it More specifically, the logical and procedural semantics are specified in 
terms of how various kinds of labels" are allowed to propagate through a CE network. 
The information in a CE data-base is contained in the structure of such a network, and 
this information is then accessed by moving labels through the net. This kind of semantics 
encourages one to think of a CE data-base as operating in a highly parallel manner, with 
each datum acting as an active process which propagates labels. 

This document is divided into three main parts. The first gives the technical 
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details of the CE representation, using examples relating to a hypothetical data-ba* of 
census information: The «ctmd pirt is more phihiiophlca! thwi technical - it compares 
CE with other representations, diwusses why repmemtttooal fssues are important in the 
first ptace. and examines «oirw ar*ai in which the CE represeatttton te not adequate. The 
third and final part is a coBeetion of technical ap pen dio s s. " The toplti include such things 
as parallel hardware and "Wwkdg* about knowledge' (such as ^BiHy knows who jane 
Smith's real father is, and she doesn't know that ht knows.") These appendices may be read 
in any order far skipped entirety) since each U targety self contained. 

The reader should be forewarned that much of pan one is not particularly easy 
reading - the presentation is organized to minimize die number etf forward references, 
which means that interesting examples occur only after the necessary machinery has been 
introduced. The main reason for this rather dry "bottom up" format ft brevity ~ it would 
be possible to discuss the "%% picture* m parallel wim the debits, but at a cost of perhaps 
doubling the jiie of this document, ft ii expected that the JJiiiMrtiid reader win skim the 
entire document first (especially part Swo) hi order to get enough off the big picture to 
motivate studying the detam 



Part One - Technical Details 

The following three sections present the technical detail* of the CE 
representation. Section I wtroduces the abstract universe of objects, daises, and constraints 
in which CE operates, and discusses how information retrieval and inference are 
accomplished. Section 2 present* the six different primitive coapjtraints which are used (in 
the current formulation of CE) to structure this univers*. Section 3 then uses these 
primitives to construct more complex constraints, such as those involving boolean functions, 
transitive relatione and naive probability. 

1 TheCE universe 
1,1 Objects and Classes 

The CE universe is composed of atomic pbjec|s. which can be aggregated to form 
arbitrary classes. Each particular object either is or is not contained in any given class - it 
is impossible for some, object to both be in arid not be in the same class. Of course, an 
object may be in more than one class, and a class nuy have any number of objects in it 
(from zero to infinitely many), Section 23 gives mow details about how ob-jects and classes 
interact 

As a convenient notation, , tet uppercase names denote cjasfes, and let other names 
denote objects, These naroes may contain hyphens and other punctuation. Sometimes a 
name will be enclosed in single quotes to avoid confusing it with the surrounding text. For 
example. REGISTERED-VOTERS and OCCUPATIONS named****, while 'Jane-Smith' 
and lawyer' name objects. 
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1.2 Constraints 

As presented &PT&, the uhlferseihai no strudtittV-iftf Arbitrary assignment of 
objects to classes is allowed. Xanltraihts adtf the wWtsarr 'ttructare by constraining the 
allowable assignments, *or rtfcmpte, the ce^a^^"atf^e8«tt^E^ 
PERSONS" requires that every dbjtet assigned tb REfclS^ERlD^VOTtiRS abb be 
assigned to PERSONS. Settteri * defines iflHhe primitive Cl'o^r^ 
how certain patterns ©Y isbjfecfr as^nmehis (for exiniptt/*^^ 'jiihS^inith' to 
REGISTERED-VOTERS) an force other assipiifti^ i <s^1ft f al^ to 

PERSONS). These definitions provide the logical semantics for each primitive. 

The information in a CE data-base is r«pre»ent«d a* a netwbf k of such 
constraints connected to the appropriate classes. Both "general" i^^fife'ic^iiifornuition 
can be represented- ^^'Wkk^^-im- ^i^^'m^WW^t tftta-base schemes is 
discussed in part ttta' ■ &**&&&, tKediuei wtf! be draiW* ^*poM #in#ne^o¥k, ind 
the constraint* ^fe'tfftUTaiSiU'^^ The 

class^poiriSs '%*}%* *zrh§& Wriffy 1 i*ti-b4-tet&t*¥&^tlhfy WlUe text 
(e.g. *VGmm&^V&¥m$^il&to!*s th^nieJveriil^i^^yr'aW'data'base 
and do not carry any data-base information. The "meaning" of a dais-point 'Jucn as 
RECISTEREt^VdT^^fer 8 ^^^^^!^ nriterm* <rf rts connecttortt P^teh ttte rest of the 
network, and norat *^ierm^ «f whateiilf «tmuf ratme theciaii^pomt hasW arTy) 

. .;■ : ;-v. srtJ diu ; ; ;i ^rsmslrso:* ;>■' - ,. • ;.;/a.'0 &■■■;-. ■.; ■% im:3br+ ^ \l : .* v " '>•'• 

L3 Extension and Intern**' *<<o' >*■•*.> ..-. ~. ;: - "ii.i fc^ .- " ,• ' •■• • 

Within the abstract universe, the important aip^elr-of't cWi$^uth as 
'REGISTERED-VOTERS' is its extension (the objects which it contains, in this case 
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presumably all the registered voters). Within the data-base however, the important aspect is 
it* intension (how it is constrained to relate to other classes). In some cases the intension 
and the extension coincide - a class in the data-base can be constrained (defined) in terms 
of an explicit listing of its objects. For example, the class BJkQpD-GROUPS can be 
defined by listing the four blood i groups V, V, >b', and V. 

However, in many cases the data-base will not contain such complete information 
about a class (i.e. it* extension). For example, consider JAN^M^H^S-Bl.QOD-CROUP, 
which is a one-object class conttlning an object such as W. If the data-base does not 
kjrtow Jane Smith's blood group, then it does not know thje ; exj^is^>n of the class. But, it 
does know some things about the class in terms of its intension (how It relates to the rest of 
the network). For one thing, the class can be constrained to be a subclass of the BLOOD- 
CROUPS class. For another, a person's blood group can be constrained in terms of their 
parents' blood groups. The important thing to remember is that what the data-base 
"knows" (via intensions) may only be a small part of what is true in the universe (via 
extensions). (Note that the meanings of the terms "extension" and "intension" as used in 
this document are not the same as their meanings in modern mathematical logic The 
meanings used here are similar to those used by Pople 09721) 

1.4 Inference 

The reason for being concerned With extensions in |he ; first place is tjiat they 
provide the foundations for the logical semantics of the conttraints {as described in section 
2), which in turn provide the framework for making inferences from the data-base. Here, 
"inference" means the process of accessing the data-base in order to acquire information. 
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The kinds of inference cdmiideSred here ire: 

(1) Retrieval ~ the user aski questions of the data-base. 

(2) Consistency - the user adds new information to the data-base, and wants to be notified 
if the new data confUctswirti ixlsung 1 «Jata. - ,y 

(3) Redundancy - the user adds rievTdata and 'warrts lo be notified if the new data is 
redundant (ie.'fc implied by existing data). ^ 

This document focuses on the issue 6f consistency for a couple of reasons." One is 
that maintaining; a consistent data-base fn the *reai wdrtcT'is mm a Very difficult problem, 
and CE provides one p«Jrt^ solution. TTie other reason U that all thro* forms 
oi inferenoe esui t>« subsunteol tutelar oi<MMti«tMoio- okeokint;. 
For redundancy checking, a new datum is redundant iff its negation is inconsistent with the 
existing data-baie. Tor retrieval, a "yes/no"' question such as "'16 'jane Smith's father's blood 
group the same as W rnbtifier'Jif" can ibe answeied^''cne^^ ( ^'1bs«r^ that *j»ne 
Smith s father* blood group is the tame as her mothers for both consistency and 
redundancy: If it is ineonsiaent, then ithe answer it WTtf wt Awwaw t then the answer is 
"yes"; if neither then ''don't know^Stece che&mg fw rtdunda*^ means checking the 
negation for inconsUtency.theri anawet to the above ye^rno J qlwsww'ii" ,, ^s* #f the 
assertion that "jane Smith's father's blood group is mw the same as her mother's" is 
inconsistent with the existing data-bast Retrieval for "find" questions such as Tind Jane 
Smith's hair cooler Tr^ ev^ but involves 

added complications -ffi$^$jMtiai ^append^A. ' ' " '- < ' ;:) ' ::: > ' '■'■'' ™ ' 
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1.4.1 Labels and Propagation 

The consistency-checking inferences are performed by propagating labels through 

the CE data-base network. A label is an extensional device which relates an object to a 

class in the network. A label names an object and is "on" a class - the label can actually be 

written on a CE network diagram next to a class-point (which "puts the label on the class"). 

A class may have more than one label on it, and the same label may be used on more than 

one class. Here are the three kinds of labels Cobj' is some object): 

♦obj The class contains 'obj'. 

-ob j The class does not contain 'obj'. 

■obj The class contains exactly the one object 'obj' and nothing else. 

Note that -obj is a special case of ♦obj, so anything said be low about V labels applies 
equally well to '•' labels. 

Labels propagate through the network because each constraint looks for certain 
patterns of labels on the class-points to which it is attached, and then creates other labels 
when such a pattern occurs. All of section 2 consuls of specifying these patterns for the 
primitive CE constraints. 

1.4.2 Label Collisions 

Two labels can "collide" at a common class point in two interesting ways. The 
first is that an inconsistency is detected whenever the same 
class-point gets labeled with both -obj and ♦obj for some 
object - it is impossible that an object both be in and not be in the same class. 

The second kind of collision occurs when a ♦objl collides with an -obj2. In this 
case, *objl' and 'obj2* have to be the same object. This is because the class 
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does contain 'objl* (from the «©bjl label), and it contains only *&>$ (from the -bbj2 label). 
Since *objT and ^^y^tn^o^Mean^en^ each other 

during the inference; This ai^ to the other, 

for example, a ♦objl ail ! '^^tc^^m'^Mt^u^i'im inconsistency (as above). In 
general, any class labeled with either Wjkt will be ImpMif labeled with the other, and 
these implicitly created* 'label may propagate b the usual manner. 

1.43 Initial labelings 

To check' the consistency of an assertion against the 'laa&fease. the assertion is 
represented as an initial pattern of labefc in the "nkwor*". ^1r^ : tf^l^^en ; propagated, 
and an inconsistency is indicated if (for some object Y) <* and m Sf'W> lib^ls''"eoiiid« 4 at 

the same class-point. If the data-base (without the new assertion) is consistent, then the 

, ■ ■ : •■■■.'■■'.■ .^r::'w vis/i: :.-;:: i .barf :■■*#« ills ii:si,*-;w -"M ?.<aiOf ;■&£*{;> srij nu <in*sJ€J 'to zun:- :■?•:■. 

inconsistency must have arisen either because the new assertion is Mentally inconsistent or 
because the assertion is inconsistent with the rest of the data-base (which is the interesting 
case). 

This paragraph introduces two important initial labeling patterns (which are used 
throughout the rest of this document). Let V be a new object (one that does not already 
exist in the data-base), and fet^t* and $' be two class-points h the net. 

(1) If labeling A wait *x and » with *x yields an inconsistency, then it means that there is 

■' ;;; '"- .•■,'■'• ''-jifi^ &)A*'i Lcso"" a*od.. i;s.?av^ bal^f^l ?^»?m t.ai^^"*:;^/;^ 
no such x . That is, ft is inconsistent to assert that A and a have some object in common. 

This demonstrates that A awl Bare mutuary exclusive chisies. r 

(2) Similarly, if *x on A and *x on B yields an inconsistency, it means that there u no object 

which is in A but hot toX '"'" v> ~'"' 
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2 Primitive Constraints 

This section discusses the six primitive constraint expression! used in the current 
formulation of the C£ representation. . 

A CE data-base consists of. a set of class-points that are interconnected by a 
network of these primitive constraints. Since alt inferences involving the data-base are 
performed by propagating labels through the network, a primitive constraint's meaning and 
behavior can be completely specified in terms of the label patterns it responds to and the 
labels it propagates on the basis of such patterns. Thus it is easy to add a new kind of 
primitive constraint without having to worry about possible interactions with previously 
existing primitives. 

In this section, each primitive is described by giving its intuitive meaning, its 
network symbol, the propagation rules, and some examples. More examples occur in section 
3. The more philosophical issues concerning wJut the primiu\yes "really" mean and how 
they compare with those of other representations are discussed in part two. 

2.1 The Partition Constraint 

This constraint represents the partitioning of a superclass into exclusive and 
exhaustive subclasses. The network symbol for this is shown in fjgure 2-la - the superclass 
(here, 'A*) is drawn on the convex side of the bar. and the subclasses (here, 'B' and *C') are 
drawn on the concave side. Note that there i^j^.n^.j(o/: i |e{jf^ i ,0iitn two subclauses, 
and that the particular left-to-rjght ordering of the subclas>e* U unimportant. In figure 
2-Ia, the class 'A* is partitioned into *B' and C\ This means ^tevery object in ,'A' U in 
exactly one of 'B' or "Cand that no "extra" objects (those not in 'A') are in 'B' or *C\ 
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For a more concrete example, figure 2-lb illustrates the partition of PERSONS 
into FEMALES and MALES. As an example of a partition with more than two subclasses, 
figure 2-lc partitions FEDERAL-EMPLOYEES into LEG^ATIVE-EMPLOYEES, 
EXECUTIVE-EMPLOYEES, and jUDlClAL^EMPLdYEES: Figure 2-id says that the 
class of RECISTERED-VOTERS is a subclass of PERSONS - the unnamed class-point is 
what is left over (i.e. persons who are not registered voters). Figure 2-le says that no 
registered voters are convicted felons (without bothering to nfme the superclass). 

Note that any of the classes may be empty: Figure 2-1 does nor say that there are 
any registered voter*. In fact, aft the classes in figure M could be empty (since the empty 
class can be partitioned into empty subclasses). However, if there are any REGISTERED- 
VOTERS, then they are constrained to be PERSONS a*& constrained not to be 
CONVICTED-F1LONS. f he notion of subdau such ai in f# occur* frequently enough 
to deserve a simpler symbol: Figure 24f uses m symbol, whkh should be interpreted as 
an abbreviation for the one used in 2*L Similarly, the constraint of ffcureS-lg (saying 
that 'A* is partitioned into exactly 'B' - they are the same class) hat its own symbol, shown 
in2-lh. ' '" : -" ■" ■■"••■' r <'- 

The five label propagation rules for the partition constraint are diagrammed in 
figure 2-2. Each of these rules deserves a case in which enough information is available 
(in terms of existing labels on class-points) to enable new 'labels U be propagated to other 
class-points; these new labels may then in turn enable fuirtner propagations! The left-hind 
side of each rule gives the relpvtht pattern & existing tabes, and ft* ttgbt-hand side gives 
the new propagated labels, the tttfpsei (V) indicate that there maybe more subclasses 
than are explicitly drawn, fn the figure, V is used as the name of the label object - the 
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five rules of course apply to any object. They are; , 

(pi) If an object is in one of the subclasses, then it must be in the superclass and can not 

be in any of the other mutually exclusive subclasses. In other words, the existing *x label 

provides the information that Vis in one of the subclasses. Thu information is sufficient 

to deduce that V roust be in the superclass and not in any of the other subclauses. This 

deduced information is then represented in terms of rtew labels which are put on the 

relevant classes. It is in this manner that label* "propagate." 

(p2) If an object is not in the superclass, then it can not be in any of the subclasses. This 

is a consequence of (pi) - if the object were in any of the subclaues. then U would have to 

be in the superclass, which is false. 

(p3) If an object is in the superclass, and is not in all-but-one of the subclasses, then it 

must be in the remaining subclass. This is true because the part^ion is exhaustive. 

(p4) If an object is not in any of the subclasses^ then it can not be in the superclass. (This 

too is true because of exhaustivenew). 

(p5) If the superclass contains exactly one object, and the object is not in all-but-otve of the 

subclasses, then the remaining subclass must conjoin exactly that object 

Note from figure 2-2 that (pi) and (p3) are in some, Jfise duals of each other, as 
are (p2) and (p4). Rather surprisingly, section 31 below sbow^ that this one constraint 
suffices for representing all of the Boolean functions (set-theoretic union, intersection, etc.) 

2.2 The Object Constraint 

This constraint forces a class to contain exactly onj^ject. The notation for this 
is to draw the constrained class-point as a small square (instead of a jauod point). s Since 
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such a class represents exactly one object, it can be nanwd with the lowercase name which is 
the name of the object 

For example, figure t-3a states that the president Of the U3 is an executive- 
branch employee. Figure 2-Sb states that there are exactlf four BLOOD-GROUPS, and 
lists them - this is an example where the database hts complete (extensiaial) information 
about a class. In 12-3a, the data-base certainty does -net have complete infor ma tion about 
EXECUTIVE-EMPLOYEES -all iftnowj is that fine of fiiim%*ist be ^e^residerft^of- 
the-US*. 

The pahitioh cem^raHnt is used in2-3b in order to force' the four Wood groups to 
be distinct objects: In CE, the fact that two objects are distinct most be indicated 
intenstonaHy (i.e. via some constraims in the data-baie), instead «f just assuming that two 
objects with different names ir# eWFerent (extenjhjnal) objews. Another instance of this 
distinction (duer to Frege) is that "the morning *«tat* and ^§iiiiiMi^^^^^^ ( i»me 
extensional object (the planet Venus) even though they art Trtmed : differently. Indeed, in 
some ' cases' irmay%rd¥W^^^ nW'^afeJectt are' the 

same Note that CS dc« not require an explicit panitlcm cOrtstramt between each pair of 
distinct objects (which would be atfiti wasteful) ■& alf that ft reptred%'86 ^ able to infer 
this distinctness (by, for example, starting a i *x libel on both the objeas and deriving an 
inconsistency). . ; n.: 

The propagation rule for objects is very simple: 
(ol) Each object class "broadcasts" an -obj label naming itself. * 

For example, the object cfals^b' m^Sfr starts an *4b label from Itsetf. The label says that 
the class contains exactly thl bhe^sbjec^^whicli is ofic©uW 5 p^$i^ forted bf the 
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object ce^istrainv The «obJ labels, so generated can *nen intera^. wjth other cautraipts to 
propagate further. For example, in ?-3b the r *$ p^'ab' cap PM9f&H& *** t9 BLOOD- 
GROUPS as follows: The„-abon 'ab^ entail* a ..♦ah.oB 'ab'^^ifc^ 444 which a,,ow « 
rule(pO,top^agatt^*abtoBLQOD^ROUJP|^ ^ 

To prevents £E database from, cjtapojyiwlf w^ tpp jnany labels, jpme 
technique U needed to have only the Velevaiu" cfcjeft cla^ broadcast (heir label* during 
an inference,. One joluj^on.ii to consider an object da» tq^r^Yluit? (with respect to a 
particular inference) iff it gets labeled during that inference, Tl^u* an inference whkh has 
nothing to do with bipod groups will not cause- the bipod group p|»ject^<e.g. ' aD '> to 
jbroadcast,j^nc^of t^ 

Note jtbat Q£ "objects" |an be wwd, to represent a wide variety of database 
entities which are not "objects" in!he,iwi^£s#^ 

'ah' cljea-rly has no physical exii^nce. In 2-ac, 7 th* object* 'fea^ak'^nd 'male' are not 
particular female* or males, but are objects denoting ibese sexes tbenv«1ve$. rSince /female' 
and 'male; have U^.meaping^a univ«^,wUhou> f«m»4« ^»4. m»l«». th«re must be a 
strong coijnectioA between the chusq£ F^AJ^%w^th^ } object ternate'. The, u#xt 
subsection discuises what this connection is. _. 

2S The Binary Relationship <^*|raint - ;0 / 

This constraint allows the use of binary relations, such as "sex,-pf"j "father-pf", 
and "bloo^-group-of". Its network symbol U drawn a* ihown ip figure 2-ia, which states 
that the. binary relation ^X-OF hold*be|»ee^ and 'female'. 

Almwt all data-ba*e scheme* have some such cwiitrqct fojjaswgning "values" to 
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'characteristics" of objects * what^EVbinarf relitfOnititp* cotii&alif does i*& exMfnd this 
notion to encompass art«n%!^ssWin tiftffl&'&ttittlftt^ fitl 'Jane-Smith*. 

This extended ate if ihow in f ig^rir 2*tt/T>ie'meantng of it is that the cUjs 
*R' (the "range") is the image of the d^W^*ti&Mtil fliM W1ttftiyStti*ri : V. 
That is, # R* contain* eiictly 5 ^ise«ofej|e& i^n^ la*if^^^ ^Ws«<ne object in *D\ 
Figure 2-k uses tMi^s^-fliitfth^SiK^ all F£!if Afc^irt«na^tfettg^ ! n^4(bject 
classes for both the donliiri an* tWfttifr !l fipft ^4a ft s*^^r^n#"i^fcW all 
PERSONS UMther^feWkk'o^liaif. j ^ " ^ *>" "'"' "* i5 ' ! "* '•-"* "' ''•'• 

Note th»t *D' i« f i$u* 3149$^ &*> "domain of - r lKi' refttidh *B W in the usual 
sense of the word. Normally, the "domain of 's^MMr? »Wlal^^^t8^^Irf objects 
which are related 8f tf* rihmin to^ of the 

SEX<JF re1at§oo"taUi^ lit '•&*&&&*&*&** Used in 

this document/the ddmain tt~*t» tt^ btnWy 

relation The >-*milF then ^toii»i§* tJPirffc ; MMrf4mm& i ^ih&**mtn. 
FUitt«mo^W<foMtf ^ t^^nitatibnf to 

Anything. For e*arnple, figure 2^ w^^ 

by THINGS'. Presumably things such as ch«i^**liil^4M^^ 
contribute nothing to the range. The changed figure 2-4d would state that the image of 
THINGS under SEXOF is the class of 'mate' atfVinttf ^M^MfkiitM haVTno 
sex is irrelevant ; *'*' '^ lii ^" ,0i ' i '> {, ' i <fjen.;rf '"'■ <•■,;./ w: ;vy :..■■!« )hj.si?mo3 *;H'T 

' Note that theor^rtiTf^r«'2^alw irtpHtt 
person and cwnialtf^rsbn*^ 
in the rang* iralid^^#it!tt^ria«!ld%%6HM object Wttf ^Mtf'ti^fW^&M 
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must be a female person). If this existential commitment is not desired, the' expression can 
be made weaker as in 2~te. Tha one says that the SEX*OE aH PERSONS is contained in 
the class of the two SEXiS 'female' and !mak*: The chus *XX' «ay be any subclass of 
SEXES (including possibly the null clau, which wouldmean either that there are no 
persons, or that aU persons have no sex). A* above, it is not required by* the definition of 
"image" that every domain object (a per son) be related to at least one range object (a *ex) - 
one simple means of achieving this kind of symmetry isfivwtun the next subsection (2.4). 

One remaining issue is what che *B' das* rea^y Tepwsenti, and how it may be 
labeled. ExtensionaMy, a binary relation an bethought of a* acJau of ordeted pair** each 
of the form «d,r> where W is aW object in the domain, and V is an object in the range. 
So, for example, SEX-OF can be thought of a**ehiai H whs^aBOrrtah*» pairs such as 
<Jar»e-Smith^emaie>, <BiHy-Jonej^r»le>, etc This is jiw an eatemional way of looking at 
classes such as^JSEX-OF' - it is not attua^ ne«e$sa«f that the database contain 
(intensionally) a table of all the ordered pairs occurring in each btaary rebtion. 

These <o>^ constructs can be used for labelling binary jstattan classes. As with 

all labels, they may b« of the form: 

♦<d,r> The relation so labeled does relate 'd* to V. ; , -. 

-<d,r> The relation definitely does not relate 'd' to V. 
. -<d,r* The reiatioiv relates 'dUo V, and relam nothing rise. ; 

As with -obj labels, -<d,r> implies +<djr>. Also note that <d,r>j being an extensional object, 

can participate in all constraint propagation rules j«st like 'obj*- kaete car\ For example, in 

figure 2-4f, a ^some-child.its-mother^ label can propagate from MOTHER-OF to 

PARENTSrOF via rsule^l). 

The seven propagatton mks for the binary rtlatioa»Wp o»«raint are shown in 
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figure 2*5. They are 

(bl) If object 'd* i$Miie«to*»^, iM <d#» i* in the relation, then V is in the range. This 
it an immediate cot«qu«ct of the definition of "ifwge" gi*«n above. 
(b2) If an object V a in Che range, then U*«f# rowt tie ion* object in the domain which 
bears the relation to it It if not known what this objects, so * new name -wMt be used, for 
it<in this ca«, 'gOOST ii being used as tbt new name). Art* enokmentation of a CE data- 
base must contain some provision for generating men new objects. In the rest of this 
document, ttoesegeneraiedo^ by a 

4-digit number, For rujs?4b2), it isinewn that <he new object is in the domain (hence 
*g0037 is put on B), and thai it beats the relation *s¥fitO the object V fcence *<g0©37,r> is 
put on B). Some estarapie* betew show how this v*ork4 in practice. 
(bS) This is a consequence of #1). If V U not in the range, and - V **<b* the domain, then 
<d.r> can not be h» the relation, tf <d^ w^re in the rebtion, then .^ *oirid betin the range 
(using bl) r which if fabe. 

(bi) Thu i* a«other awequwoe of (bl). if V is not is the raqge, and <d,r> is in the 
relation, then *d' can not be in {the domain. Again, if W were4njthf domaw, then V would 
be in the range (using bl). 

(b5) If the domain contain* only the one ob ject 'd', and if «d#* is not in the relation, then 
'r* can not bo in the eange. 

<b6) Similarly, if the relation contains only the one ordered pair «d,r>, and the domain 
does not contain 'd\ then the range can not contain 'r*. 

(b7) Finally, if the domain contains only the one object 'd', and the relation contains only 
the one ordered pair «d,r>, then the range must contain only the one object V. 
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Figures 2-6 and 2-7 diagram some inferences using combinations of these rules. 
In 2-6a, assume that the data-base contains the information above the dotted line* which 
states that the sex of Jane Smith is female If a user of the data-base desires to know the 

■ ...-..:.■..■■ " . ' ■■■.'■■■■■ 

sex of Jane Smith, the user can construct the network below the dotted line. In order to do 
this, the user must have access to the class-points for 'Jane-Smith' and *SEX-OF\ but the 
user need not have access to the rest of the network. In particular, the user presumably 
does not already know about the binary relationship constraint (above the dotted line) 
which exists in the data-bate - otherwise the user would already know the sex of Jane 
Smith. 

In general, a user interacts with a CE data-base in terms of some fixed set of 
class-points. The data-base can be viewed as a black box with "terminals" (the set of class- 
points) with which the user interacts. In simple cases, the user ,,ca^ ; accea$ the da^ta-base by 
setting up an, initial labeling on some terminals and seeing if the automatic label- 
propagating inference procedure inside the black box, produces an informative result (in 
terms of label collisions). In more complex cases, the terminals might not directly express 
the classes in which the user is interested - in 2-6a there is no terminal for •tbe-sex-of-Jane- 
Smith*. Thus the user must construct the appropriate class (a new terminal) in terms cjf the 

■■■■■■'■ . . «. ■. i ■■■.'■■■■ . 

existing terminals. This is the purpose of the network fragroenrbelow the dotted liryt in 
figure 2-6a. In general, a user temporarily adds such a fragment to an existing data-base 
in order to define whatever new terminals are necessary for the current inference. After 
the inference, the temporary fragment can of course be^deteted. 

Having constructed the fragment below the dotted line In^-Sa, the user knows 
that the object-class 'x' (the name is unimportant) is constrained to be the sex of Jane 
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Smith. That is, the new terminal V is constriined to be a class containing a single object, 
that object being the sex of Jane Smith. Now, the^objeckia^ V i, known (intensionally) to 
be the sex of Jane Smith, but presumably there is ^already «c«ting object in the data- 
base (in this case, Yemaiel w*kh id eouivalem to V but is more interesting" (in that the 
user and/or the data-base know more about 'female' thin they do about V* The user's task 
is to set up an initial labeling such that the propagation procedure can be used to find such 
an existing object. '"' : ''' :s ' : " Q ' :: ' " t: ' i -^"' "■■ <! '-' 

An initial labeling fend the" subseouem ; mference) which does this is shown in 
figure 2-6b, which « a copy of 2-6a with the addition of labeU. The number in braces 
preceding each label is used to indicate the order in which the labels are created by the 
propagation rufes - these numbers ire only for ^venWce in I ^^rt* diagram and 
are not actually needed by the inference procedure. Following 5 the number is the 
propagation rutt used to a&elhi labe* so U^i$%^^ s %J^^ 
was created at time 2 via rufc<bi>. (Tne following English' dSr^ of how the labels 
are propagated is rather cumbeVsome; but it contains no more ^oV^ation Ihan the labels 
themselv* dorT^ ^ ^ -- ^^^ __ 

it says that the term^^twhlcn ^m&s&af ^s^'%^M stains exactly 
the object ■ V. The two libels numbered {1} are created by rule <b2) - the .* on V 
automatically en£h. V^ ^ 

object (here, g00S8) and create the new labels. The {2*1} -x label is created by applying 
(bl) to the two labels just created in step {!}. Rule (ol) allow, "female* ^broadcast an 
-female (label number {3,ol}) which coWdes with the *x already on Yemate'. 

As described in se^ion 1^2, this kind of coliisioii mean, <h*t V and I temaie V are 
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the same (extenjional) object. The colUiion an be nude known tfe the uter, who thus 
knows that \' (the «x of Jane Snutb) u jwiano^<rJWn>e for 'fepiAle'. Note that all the 
user had to do wasjp.cre$te the network.bek>w ,|he dotted Une .aj^,j^cn,.sjt up an irij^ial 
labeling. From that, the. bibel-propag fling . ^m^ : pttf^y^t^<^^»i that 'x' and 
'female' are the »me objea, and that cwcluiion w« given to thf y«r. -Al n£ Ui21£ did the 
user have to be concerned wijh the mucture,of nefw^aboy^ the dotted line (which 
represents the existing data-base). 

AH of this may sefm to involve aj^ex^sjyi amoi^B|t of effort for performing 
such a.*impje inferenc%but ex|c|ly th| same technique >c|n s bi,us|Cv v Jn\roo^e complex cases. 
Figure 2-7 diagram the inference that Jane Snath's female parent u Mary Sniith. 4g» in > 
aasurae^e strucjuret above the.dotj^ Uai.-a|$ below the 

line a^rf , cpnsttucted by the user for the purpose \ of this one, $Ztqai0> i^cfe^hat the data- 
base contains both "specific" data duch a* that Mary £**!$» JpJlfff ■. Smith's mother) and 
"general" data (such as that MOTHER-OF and^A£HJ?&fl3I >*§», **W, *«" i;«P of 
PA|$NT$<>Fj|. N ok also that not ay yf thf allowed label p ro pa g a tj o m are shown in 2-7 
-it shows only those Out ar* relevant to the inference. . 

The user-constructed piece of netwoi^ below *he. dgtted Une states that V is a 
parent of Jane Smith and that V is a femaku Thf question the user want| to ask it "Is V 
Mary Smith?" Note that this is a jimpltr qaetttoathan "fi nd ^h»obiea<i) in the data-base 
which are equivalent to 'x'" - such 7M* Uiference* (akin to figure 2-6's ^jnoV the sex of 
Jane Smjth") are discwwed in appendix A. , ,,, 

Having constructed Jhe lower piece of .^e^orl^f h£4tse|, must, now set up an 
initial labeling. The wa,y to infer tha| V is.^|0-$mU|C'M.|f jil^ime |h«t V is rjp^JWary 
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Smith and then see if the mference procedure "derives an inconsistency (via the collision of 
some "♦" and "-• labefr Td awbme that V 1*i^^ on 

the object-class Wary-Smith'. The reit of th« initial labeling consists of -x on V and 
-Mary-Smith on *Nrary-Shifa»\ These a* a^mar ked as number {0} on the diagram. The 
label-propagating inference procedure then um this initial labeling to derive an 
inconsistency - at point W therVis a collision of a ♦* and a £! rloiice * of this collision can 
be given to the user, who then knows that V is indeed 'Mary-Smith', since it U inconsistent 
to assume otherwise. (Another powible interpretation of this inconiijtency is that there is 
no such V, meaning that jar* Smith does hot have any ferrate parent Presumably the 
user assumes that she does have one,- so this interpretation is rilled but) : 

Note that the twaMs^^ 
the labels in a dif f^ent^ordef, the collfeion%«il hav* beerr^rAiie^fe^ sornewhere else. 
Thus the e*a« plate wnetfe'Hw itt^ is only 

important that it be cteectwi wmitwhtt-e - iU ' '* . ti " Ut - --' ;;; ' ; -' ;v 

CMie i^tteriaTsto^ 
generated-object would be placed W*n : obj^^ tii^ame^ tii-e&jfett*iis* 'is used 
instead of the geherated^jecf s flam*. For example, tfc* ##$ ^^ftl^mfeh^o label on 
PARENTS-OT is written 7 down iiwr^iateiy, instead of golnglhto^^he full process of 
(1) generating a triw^oojecfcfcere, *$&¥)! ft) propagating *' 1 fttf) *g!&4 fcibet to ^Jane- 
Smith' (arid a {2,b2f ¥^!2&,xV febe* t^ 

'Jane-Smith' via rule (ol); (4) Colliding the ♦gt234 with the* -jiiie-Smith ; 1*nlch then 
identifies the two object* <5> then finally using 'Jane-SmttTW all label* where 'gl234' is 
used (which is legal trtcattWthetwb objects have^been iclenlif fcd irtth eacV&ner). the 
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end result is that 'Jane-Smith' is Uf^Jn : pdMii^:^^\AiiHNri'fi|i%itMnr-l | ff , f«MO" "> 
have generated the gi234' in the first pi**. H 

In general this <sjh«rt«ut if legal because the genet* t«d -object V label will 
immediately collide wi|h Ibe V label en the ^t^^r^l^fajh^firtiMli . : |4|ena|^ri<R^ Jb^»^g««|eM*te«l 

object with tbp object-clan object Thut «jith«r may b* used io pl»c«pf tht other, so it is 
legal to use the object^lass.^aroe instead of the genemed^abjtct name. In this way, the 
generated-object name never appears on thediagjara. Thu shortcut w 01 alio be used 
wherever appliqible in all the rest of the example*. 

2.4 The Inverse Constraint 

This construct constrains binary relations to be ^efj^fts^ ; each other. For 

example. PAR I^T^-QF and CHJM^ if 'p* »s a 

parent of V, then V 4s a chjld of *p\ Th*ifetwoj* i »yB^f^«tb|s J if» given in figure 2*$a - 
since "inverse" is sy/nmetrjc,^ does nor matter ^T^rejatb^ of 

the symbol. The three propagation rule* f4ar, the inwift cotutraim ajje *hown in figure 

(invj) If ♦<d£>,U,on©«esideQ 

(inv2) If -<d,r* ison one side, put ^ j^4> i o^tb§ojhe^ w XFj^e^amp^if *p' js nptaparent 

of V, then'c' is is n^-a.chjJd of *p\) ,, -■■ q} ' 

<iny3) Finally, if -<d/> is on one side, put -<r,dK on the other. 7 ^ ^ 

Thing* may be said abeut.abjnafy cnJatjafl ® » *ymf(«rk nwiner by using both 
the relation and iu inverse. For examplMJ|W# ?#£^ 
Like 2-4e, it states that the SEX-QF all P^iQNS » sow syt^as* <htrt -called *X^') of 
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the SEXES 'male' and Yemttel As*U«cu*sed;ilt section 2.3, figure 2-4e does net <ttnpty that 
every person has a sex. However, 2-9a does ^npfoiftis. lSihc#yfilteQNS friheSimage of 
XX under Bltt#GS*OF'S£X; , fbr eve^persojttHei*«wrte^i«i^o^jeet in XX such that 
<the-xx-object,the^persohi*«is W fc£»i&BJ©F-SEX <t*si«g mkbiY Tto^\^s* constraint 
in 2-9a then forces that <rhepersb^ftN*xx^jett> 'bein^^EX^^.'-'^ICfteWYlM^ehBh^lMi 
a sex (which is *thexx-obj«#5l Nol* that none of this requires that a person ha ve one and 
only one sex - this fopfcef *W f tfrwttdni" •£ dlicuisedViri dettilinl^ion^e. 

As another example, figure 2^ states rhat ? SPdlJSt*dF?ifts own inverse, 
which is means that it is a symmetric relation. 

: 2.5 The V^kfConsirihlt l1 °' ■'• i "-- i ■'■■'■ ; --' ; ' i;;i - : *'•-■■■'• : ' J ^' mikm:- -:>i ; 

So far, we-haVe»iuWecr>i!fiaf ^1^M»mkMlA>if^4iiiWy ednitaiit bVer time 
and ■T^'i9e^ : ^iM^ f, p^^^ii»^^m : ih^ has i^iiMI 4a&4f aWir^ha*»the 
extension of -a class tuch ***IMlST£fe£f>V©TiERS'fr ife^«l»'W*^det4h«^ However. 
in the rear "world mahf tti^^emge'W time: Wii*m!mmfem^#8&*<>TtRS 
gains and loses objects as new people register and okt ones die or let Iheir registration 
expire. Also, tr^^te*s%IWS cT^ 

which is taken. For**x*i^;f soii^ all 

DRAFT-EVADERS to be CRIMINALS, while other uteri rn^tw^P " ■ 

Fortunately, ibef#% arfralher^iimple .mechanhm which helps solve both these 
problems, and which wW8Sme to I* Wtatf-itfiifloteM'rilM* M t#llb*^sections of this 
document. The basirttk* il'fc Support moltipfe ,, wor«s'' in rh#^ite^k A world can be 
thought of as a physical 'br'#^p1iys^<^tidff. s&has^the^ hy^cat^toaftoiifof "April 3, 
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1946, at 10:23 am", or the metaphysical situation of "what Jane Smith believes to be true." 
The latter might be important if Jane Smith is a data-base user who wants to add 
information to the data-base that conflicts with information that some other user wants to 
add - the data-base should then be operating in different worlds when it is being used by 
Jane Smith and when it is being used by the other user. Section 3£ discusses many. other 
such applications of worlds. 

2.5.1 Extension 

To support multiple worlds, it is necessary for the data-base to be able to contain 
assertions relative to the various worlds and to make the appropriate inferences from them. 
Since the inference process deals only with propagating labels, the inferences can be 
relativized by tagging each label with the world tha? created it. For exarnpje, if V is the 
world of "April 3, 1946, at 10:23 am" then the label "♦x/w"wUl demote, a, tx^bel relativized 
to that world, and similarly for k -" and V label*. For. example, haying a ♦Jane-Smlth/w 
label on REGISTERED-VOTERS states that Jan* Smith U a registered )foter in wprld V, 
without making a commitment one way or the other as to whether Jane Smith is a 
registered voter in other worlds. 

For all of the label propagation rules, given so far, it is necessary to add the 
stricture that two labels may interact only if they are tagged with the same world. For 
example, in figure 2-lOa, rule (p4) can not propagate a -x to class 'A', since the -x labels on 
*B' and 'C belong to different worlds. A|so, every label which is propagated during an 
inference must be given the same world-tag as the labeK*) which caused the, propagation. 
For example, in figure 2-10a, if point *C were labeled -x/w, then VV' could be labeled (using 
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rule p4) with -x/w - the V world-tag being required to relati vize it properly. 

Furthermore, wortd-tigs must be taken into account by the object constraint's 
rule (ol): An object-class such as the-presidem-of-the-US' represents the same intensional 
object in all worldt, even though extetitionally it may be different (there being different 
presidents at different times). Therefore if obf is an object-class, it is allowed to broadcast 
an -obj/w label for any. world V. As in section 2.2, the data-base can avoid choking itself 
with spurious labels by having an object-class generate an -obj/w label only when the class 
is reached by some existing label with a world-tag of V. 

By convention, the world-tag used on the initial labeling which starts an inference 
will be 'inf (an abbreviation For "current inference*), and every label written without an 
explicit world-tag will be implicitly tagged with 'inf. 

In addition to using wbrloi for tags on labels, it is desirable W be able to refer to 
them explicitly as objecS. That is, 'inf, V, and lit other worlds will be treated in the same 
manner as other CE objects f jaifte4mflh\ female*, etc). For tftdbrnp^# 'w.l' and V2* are 
both worlds, then the label *w.l/w.2 on some class means that the wWta*objeet VI* is in the 
class, relative to world *w.f . The use of such fabeft is dijcuued below; 4ot' now, It is first 
necessary to describe what a world-object such as VI* "really U* - it is dearly not the same 
sort of thing as a person (e.g. , |ahe*Slnith , ) or "a tex (e.g. ¥ewiaiei(. Since a world is a 
specification of some state of affairs (physical or metaphysical), and since the CE universe 
models all information in ttrms of aligning object! to clawes, it follows that a world "really 
is" such an assignment. These assignments need not be exhaustive - the world V might 
assign 'Jane-Smith' to be in REGISTEREI^VdTIRS, assign *J0hn*Sm1th* not to be in 
REGISTERED-VOTERS, and make no commitment one way or the other regarding 
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•Mary-Smith'. ,,,. 

For consistency with material tp^e pr«ait<d txlow, » wotld-object should be 
viewed as representing a set of. ."allowable. asj^^ if ^rW *$'. makei no 

commitment regardjng^the augment of 'Mary-Smith' to REGISTERED- VOTERS, then 
the set which V represents will contain (as allowable assignments) bo t h^ "Mary-Smith if. in 
^IGIST^REp-VpTIRS; and "Maiy-S^th & nj^ ^ |i^|f ^||l|P-y OTERS." In 
brief , the i world V allows bo|h,._ppsfjbl^^ 

means that if one assignment-set is a_ subset of another one, then the smaller assignment-set 
corresponds to the stronger world (in that it has fewer cases of allowing both possibilities). 

Now, having given some solidity to the rather philosophical concept of "world", 
consider again how world-tags are used in labels. A label relates an object to a class — the 
label %x/w" on class *C' means that the object V is contained in 'C relative to the world V. 
Another way of stating this is that in world V it is Que that V is in *C\ Now such a thing 
can be nested: Consider "in world w.l it "U true thai in w.2 it is true that V is in *C'." (A 
less awkward but rather anthropomorphic way of putting ^U ^"wA believes thai w.2 
believes that *x' Ji in i C) This nesting will be represented by using the label "♦x|w.2/w.l" 
on *C' - the object V,to relative to world V^ / wMe^ i to^ v tij^ ; ^^ye^W v The main 
point of this paragraph is that the world-objects used to relativize objects in labels may 
themselves be relativized (to any depth), giving rise to ari?itrarily long world-tags. 

Like other objects, world-objects can be grouped into cfcuse*. By convention, such 
world-classes will have names beginning wi|h "W-". For example, the world-class 
W-CATHOLIC-FAJTH can be defined to contain all worlds which are consistent with 
orthodox, Catholic religious, dogma. There may be many different world^objectf t in such a 
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world-class - fof instance, in W-CATHOLIC-FAITH two world-objects misfit agree oft all 
assignments except those Involving SAIW^AIJi^-BldO^-CSI^dtlF (f.e. these two 
worlds assign Paul's blood group atffasrfllyj! ^ihWtajfioil dogma -'tiffin no position on 
what Paul's blood group is, both these workJ-objecU are cOWittsenr with the Catholic faith 
and thus both belong in the world-class. Extensionafly, W^C#THOLIC-7AITH will 
contain a great many such worW^jem, differing with «(^ other regarding inessential 
details. However, all these worid-ob jects will agree on those details which are important in 
Catholic dogma - that a divine Christ existed, that Mary was at virgin, that the Pope is 
in fallible, etc. "*" r ' :: ' sti;;: "~'" , ' •'■"■■'"'•■* 

2.5.2 Intension - 

As with other classes, it is not necessary that the data-base contain an exhaustive 
extensional listing of all the world-objects in a world-class in order for the world-class to be 
useful intentionally. For example, figure 2-10b states that aft worlds of W-C^TfiOLIC- 
FAITH are also worlds <^ W-JCf&EO-CHRISttXN-TAttR (which is meant to represent 
all the beliefs held in common among Judeo^hrittUn rehg^ 
Commandments, etc) Note that the Mjiajfw class (in* terms of the sdtkJass constraint in 
2-10b) is the stronger one $h terms of whit is befieved): The Catholic belief s include alt of 
the general Judeo-Christian beliefs, but not. conversely. A more general way of looking at 
this is that every situation (worid-bbjegt) which is consistent with the "beliefVof the smaller 
(stronger) world-class must necessarily be consistent with the beliefs of the larger (weaker) 
one. And indeed, a +w (or whatever) label can propagate from the smaller class to the 
larger : Vii ' rule (pi). Note • triat ? the' use of "welker" ' aW'*sf^oS§er** ; -A 1 (wltri ' regard to 
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classes of world-objects) parallel* the above use of these terms to describe, the assignment- 
sets for individual world-objects, The remainder of thii Mction deafer only with world 
classes -assignment sets are taken up again i»sea*Of|Jfe% ; 

Now, just as wodd'Ugs are needed in labels to rehtfivize them during inferences, 
some constraint expression is needed to r«Uti*Me,in|:«matiop4n the data-base. This 
constraint expression is |he "world constrajW," the netwrk sya^b/Bl for which is shown in 
figure SHOc Its meaning is that *B' is a subc|aii u of %& exadfa those worlds which are in 
the world-class *W*. {Section 2&& shows how constn^ttinadditiSP to the subclass one can 
be relativiwd.) 

A simple example using the world constraint i$ shown in figure 2r!0d. This states 
that the Catholic faith' believes that Mary is a virgin. More formally, it states that every 
world which is consistent with, the Catholic faith is neceuarily one of the worlds ui which 
Mary is a virgin- The subclass constraint used in2-KW bj^ 

and W-VIRGIN-MARY is needed because there might be woxkU in which Mary is a 
virgin but which are not consistent with the Catholic faith (such as those worlds continent 
with Protestant faiths that assert the virginity of Mary but deny the infallibility of the 
Pope). That is, the Catholic faith imposes stronger constmiirtt than hat the virginity of 
Mary. , c ■■..-.-p.* 

The propagation rules for the work! constraint are as shown in figure 2*11. .The 
7~" at the end of the woikHags indicates that the exact nature of the rest of the world-tag 
is unimportant. for these rules, and that any "tail" may be uniformly substituted for the 7-". 
The propagation rules are 
<wl) |f world V is contained in 'W, this means by defmttionef the world constraint that 
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'B' is a subclass of W (lit world V). Therefore, if som« -object V is contained m*& (in 
world V), the object tfttittt«lto be contained in W <iT4iMttt$»9^ Itf^her #brds^ the *w 
label on *VV "enables" the sube&s cwHtralrtt for aH objects >*m w&W&p V, in* this 
enabled subclass awstraint elates in th^sam* mammas rt^pl> ■ - < 
(w2) Similarly, the enabled strbcfass -constraint operate* hi a&e* same *narroef *r f ule ip2>. 
rf x/w ishbttntMaiper^^ 

(wS) This rule appHw when 1 world Ssftintttt Is «x|li<^ d^ibWby a -w lit»el on *W. 
Rather surprisingly, HKh nef^vt tnfOTftia^ion canrbe used MR&v&n&th*- (instead of 
just passively refusing to enable the subclass constraint, which is what havnig fro label on 
•W does). Since the ^lalSiVmean^thaVin weiWI *&mMb&W'ii gjj&'ft subclass of 'A', 
then there must bVsttn^ Wjfkt (relative to ?w*)h»W8h ft*tff^i*ir rio****^. As With 
rule <b2J; it is not tabwrf witIcW?ex^ibfH$ obprtyi%nr tfttgli Wstf a* generated object 
is used.' In tRft-Wife?NM «ps>/w 

: labeUnd ; co^rain^h^^Wtt%^th^^«fi^^^ 5 Y3ANhtf^8r< ■: 
t*4>'-lf iriHifejfc&ltf *^ Jn^^nb^^h&^f Ban. ^oc be a 

subclass of 'A* in that w»ld - fuM '*> <<:i\v-fi; M: ,-..;;;■« utti:. sdjjsi in^^or".. - 

(W5) If *B l contains eractty^ object #(& wbrfi% , )l%ittl #<MlM%lftJlfct<ta c#ftiiin«tf ittfc 'A' 
(in world 'wO, then 'A' contains all of V (in V). Thus 'B' is known to be a subclass of *A* 
in world V. - -r-^sU u; <sik .m^iUno^Jiiio.--' -:l: <<o; -fi,r-- r^UK^qo;q ?nT 

Figur#2*$ sfcdw* an infwante usm£ K^oT these «ikfc ^%s#3u>tb show 
that in air worlds whertf^i*.*3ttbe1e^of^^ iubclass 

of *Q\ Although it seems trivial, this example does show hew the propagation rules 
interact *WA* is the wetUNibtt for tbVftrst "P U iW&mW'q$>m&*V!&W the world- 
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class for the second on* The task iinw » ihew that wtffwetl&miWAf U necessarily in 
*WB' (U. that 'W A' u a subetau of MV). As described, in section *4» the usual manner of 
doing this is to assume that *WA* is. got * wbch** of W1& > Then s«r* objict V. is in *WA* 
and not in "WB* (and Us world-tag U 'inf since it is starting the inference.) .After the user 
sett up this initial labeling, the inference procedure am start propagating. The *x/in| on 
*WA' can do nothing for the moment, b*it tht -x/tnf on *WB' dm, Rule (w3) is activated by 
it and thus can create the gentrtted-ohjrc* taheh which assert that *W A' *» r>£ a subclass of 
•WB\ Then rule (wl> can propagate the generated ; V *bel through the enabled tef t-nand 
world constraint, causing a collision at point ^ which teltcaies an frc e m frte n cy. Note that 
rule <w2) could have been used instead to propagate the. %i&&y*Xk-imi& Mve caueed 
a collision at point ?'. 

2& The TypicaJ-Member Conuraim ,,', 

This constraint allows the cfctt-bue to n^ u*e #* "typkal rotmber" of a class. 
Consider representing the iof ormatton that "each person has a. tuuque mother." It w easy 
enough to stale for anv otrtkuhi person toch as lent Swsth that she has a unique mother 
- thU is shown in figure MSa. Thus one wa, to r s pt naa t 'each person has a uaioue 
mother" would be to create a tmKttu* sin^h* i© figiw Wla fee each ^ 
Needless to say, using this approach is very cKpensive if there are aavst than just a few 
individual persons, and it has the added disadvantage off r aat n i iin g cewptete extensional 
infornuuion concerning att the roemben of the cbtt 

What the tyfricataeraber eeruttaiat at*w* is fe aaeia^ to state that the appeal 
person has a uniquemother. Note that ^ye*eT w used aaee aa tM *«^ 
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and not in thesenseof *vamf ~~ att persons have a unk^ mother, not J«»t w»tt of t*»«m. 
The network symbol for % simple version of m< typ ical n wm to er for Vm"^comtraint is 
shown in figure 243b. T*ii states that the object 'sbf is a typical immber of the class XL'. 
This constraint is used ift figure 2-l3c to stale thai <v»ry peM0nh« * unique fnother.*' In 
the following text, the point *&U **tt» %e referred .vitrife &W*i*p*t class" of the t-m 
constraint, and the point «%bjkwfli he referred to*» the %m eU^Bel p ; 

Intuitively, thet-m constraint shooW na*« the f<J«Owtngh*havior: When some 
"♦" label (such as + p/w) occars on the input dkil, k b know rhtt y is in that class. 
Therefore, what is true of the typical member must be true of *p VTheeefOW, the t-m object 
can be "bound to" *p* -"by putting an -p/w label on the t-tn object Then this -pfw can cause 
further propagations. 

The one problem with this behavior is that only one object can be bound to the 
t-m object at any given time. Violating this restrietioj* ^ <a<^ sev**pKfclems. For one 
thing, if )pi* ■&&*$# ve.im ^'^Miimi^^-lt^m^Mmmit^^ *pJ/w and 
♦p.2/w labels oh tfes£l^f cMtiHft^ ibeh binding them be«i Ibti^^ll^ It the same 
time would imply that they i»1ftJb#ti^>itofcDJ^ 

input class wcwld b* Jimuft*n*ou5]y W^ttH^ »h figure 

2-13c 'the-mother' is obvkwsly not the »me (extenikmal) object for every person ~ it too 
must be ^und* m some manner; «i -< '- •;- •■ «- >r^ ; 

Although *h» "one binding at a time" restriction avoids these profelems, it has its 
own difficulties. The most obvious 4» that it May blf necemiy ^T«f«r fwo^<d»* more) 
different members of the inptH dais during a single inference. ' Another difficulty is that 
some facility is needed for -*Mmtf alltb* consequents of One binding (j*. alt the labels 
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which that binding caused to be propagated) before performing the next binding. 

One way io solve this "erasing" problem i» to tag all the labels which propagate as 
a consequence of etch particular binding. The^H, weirid be a stonptemaKer to identify 
which labels should be erased, since these label* would be the ones with* the tag. The most 
stsaightfowiNu-d way to implement such tagging is in terms of she existing WOrW mechanism 
- each different binding can be tagged with a unique (newly generated) world-tag. Indeed, 
if this world -tagging u done then it is no longer $eeesj$ry to keep the i "one binding at a 
timel. restriction at aH- For example, the above ^.l and -p-2 labels which come from 
different bindings (at the same time) can not interact with each other because they will 
have different worjdrcags - they will be something like «pJ/w2034 and -p.2/w2035. Thus it 
turns out that the "one binding at a time" restriction can be reptoced- by a "one binding per 
i«orld" technique, which avoids the dlfflcukies caused by the sequential nature of "one 
binding at a time." , 

To properly implement "one binding per world" it is necessary for the inference 
process to keep a. record of which "parent" world es^h binding's wo* M-tag relates to. That 
is, it ♦pJ/w on PERSONS causes a binding of -p,U*20M to 'the-person', then it is necessary 
to remember that 'w2034' is really just a copy of V, with the additional restriction that 
'the-person' is bound to 'p.l*. This being the case, 'w20S4V4abtl| should be allowed to 
interact with V ones, even though V2484' can not interact wKh other worlds in general. 
(In particular, *w2034' must not be allowed to interact with worlds which represent other 
bindings of 'the-person'.) 

More formally, 'w2034' is a stronger world than V -- J w2034' disallows the 
assignment of anything other than 'p.l' to "the-person'. This being the case, the stronger 
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world-tag may be validly substituted for the weaker one. -This U valid because if an 
assignment is forced in the weaker world (as indttated by t h e Jpf e sett ce of a fatbel with that 
world-tag), then the same assignment mutt be forced in the stronger world (since this world 
allows fewer arbitrary assign mena). Slree me a«sre>ffl»ent isfowad in the stronger world, 
that fact may Iw ifKlkated vm a label with the stFW>ger world as its world'ttfg. For an 
example of this, in figure 2->3d the -x/w on 'B' and the -x/w2Q94 on *C' can interact (via 
rule p*> to propagate a -x/w2C34 to W if 'w2»M rtronfer truth --V- This happens 
because the -x/w can h*ve -K/w^OM $ub«rtuted for it, which allows rule (p4) to propagate 
the -X/W2034 tb 'AV >.*' ■ '•"-"' ■;=-■•'•'• W \ : -'V ''* 

Since t-m constraints may be nested (as in an example below), the process of 
generating a- stronger world for each binding can give rise to a multHayer free; of worlds. 
It is a trw and n« just a Imear chain bwause a H^ 

different binding^worlds: figure 2*13* shows the (one»kyer) free fo«lht^xa*np^ibo^e, in 
which Sv2054 l and V20^ are both bindings treAted f^^Ff%^fmtfaict' of nesting 
allows sucftiyew*tfbe ; m ea fr in the 

tree » strongetthan alMfw woiidi ^fcbove" it in the trei^M. its parent, ■"*» patent's parent, 
etc.) ■- *'• - ■■ - * - . ; -- ■• 

As an example 4>f« nestings if a ♦p.l/w2694 label occurs on the*mpue-ciass of some 
other t»ro constraint ^e^^ 

•pJ/w20S3, where ^VdMHB^b^ new, world. Figuw>lSf shewj^he "worid tree" ex tended by 
this new binding: The new world V20S7.has *w2034' as its ^tmi^mimjti&imekmtk^ as 
its parent. In general, a^wocU s^Eh as:w30S7' is sJlowed to interact with; alt the worlds 
Above it in the tree (here, V2D34' and w^becsuse all tirtworioU above a given one are Just 
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weaker versions of it (they lack one^r nwfb|pjling|). v 

In order to handle nesting properly, i| is necessary to >,k$ able tp explicitly refer to 
the world-tagi generated by the binding process. F(gu#>^i|^ t Jhe full version of the 
t-m constraint In addition to the input class and *hejim^jacjt,,# hai an "input world- 
class" CW-IN*) drawn on the input class iide, and an "output ^tfcclass" OV-pUT*) drawn 
on the t-m object's side. (The in and put worW-claMn max be drawn, ^tKer above or below 
the centerline of the t-m constraint symbol - they aje djygngujshe$,oojj bj wWch side they 
are on.) The world-tagi generated during the binding pro^i* are put on the i W-QUT class 
- this is what makes it possible to use thine generated tags >n pfber constraints. W-IN 
exists primarily for efficiency - it acts as a "filter" in that an object in the input class (such 
as indicated by ♦p/w) will be considered for, binding 0fg b£ U in W-W, That is, the 
meaning of the t-m constraint is that the t-m object is typical of the input class relative to 
the worlds in the input world-class. If an input world-daw is not specified for a given use 
of the t-m constraint symbol, then iiapp|lesJ©a^wp|'Jd^ 

The two label propagation rules fpr ^hft t^ constraint are stepwn ift figure 2-14: 
(tml) If an object is in the input class and (^object's world-tag i* in the input world-class, 
then a binding can be created (as discussed aboveX and the binding's newly generated 
world is propagated, tp the output worjd-class (if any). Th^e binding^ worW is also noted in 
the world tree. (The diagram show* the world tree both before and after the new 
binding.) If there is no input world-class, then any ohjact in the input class can be used to 

create a binding (regardless of the c*jec^sw^u«^ , 

(tm2) This is the convert of (tml): If a binding jJom exist, then the object which » bound 

to the t-m object is necessarily iii the input das*, and the wp^w^ich, caused the binding is 
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necessarily in the input world-class. The essence of (tm2) Ite in defining exactly when a 
binding does exist. Referring to figure 2-14; there are Preconditions which must hold. 
The first is that some object (here V) must W bmind totheH-m object. The second is that 
the binding world (her* *tfflJ¥or V mtfct W^M^at *w©f$^ass: Tfe third condition 
is that the world (here *w:i'> which "believes* fhat : V2* is in the'output world-cl«s must be 
above V2* In the world ^ree (the dotted line between Vi* and *w 2' in the world-tree 
diagram indicating that 'wr* need nbt be immediately above '**'$i that is *»$ must be a 
stronger versidn of VP fas created by some oindlng); l^aleh together, these three 
conditions define what a binding is - it can be seen from the top i'&tf of figure 2-14 that 
rule(tml) satisfies these conditions when it creates a mw bjiwlttig' ffete that (tml) stales 
that the output world*l*ss remains alt the bjn^ng-wollds; '(tm^ state* that It contains only 
the bindkig-woWis:- ' i! ■'■•-■ sV "-' v u.sn:'-:>j nr* -- -^ 

Figure Mf5 shows' in mtfh-eiice using both #*l) ant W§£ v ' f ne top part of 2-15 
sates that tfaz\*u otmm&kkl&TS t<tiim of 'exa<$y%ose 'per sons who have 
children who Hav* chrWrefl. 1T% iee thkt this is f the ;; caf£^lerWwrk-ctass 'WX'. 
One one hand,, WX is dtfined-to M , ^'ou^t I WtS4si^'r : o^''l^ , t^^nmW involving 
GRANDPARENTS. -Front thit constraint; WX contains &attiy ! flioV wofi<fs In which 
the-person' is bound, in i ^orne"grand|ttrerit On tMother^i^/vt^'J^iiW^'ihetl to be 
the output worW-difss for ^^^^hV t^fcotistraint involving THfi^RANlXiHiLbREN. From 
this constraint, WX^tafof <ex*tfry tfobse worlds in which ^th^grandchiicr K bbtind to one 
of the grandchildren (and hence TM^IR/WDCi^fttt^ < -h : n^^ > «j^ty class). Since 
the same output wbrW^dtts WX ituwd for bdth these ^constraints, it means that every 
world in which ^he-grttrdcrnTa' ft boUhd to one of THE^RANt^rXPRElN is also a 
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■■»■.■ 

wofld in which •Unperson' i* bound to one o| 4b*<i^ri!ftJ?AR£NTS (and vicesversa). 
Now, since THE-GRANDQHILBJUN are in fact constrained by the two binary 
relationship constraints to be the grandchildren of , the*pea©n , n it follows that *the-persan' is 
bound to some grandparent in exactly those worlds in which *the-person' i* bound to some 
person who has a non-empty class of grandchildren. 

To see how this work* in practice, the middle part of figure 2-16 contain* the 
information that jane Smith is a child of Mary Smith. Now* *wpp9*« the ipwe*;part/0f 2-15 
(below the do^ed line) represents new information to be added to an existing data-base 
(above the dotted lintX This new informaUon stales thatllitty Jon« U a chiW of Jane 
Smith, Now, from this new information, it should be possible to Infer that Mary Smith is a 
grandmother (even though that the new informatiao mattes jH*;f£fe§eoce at all to Mary 
Smith, and indeed the user who adds the. information need »H»t|^en know that Mary Smith 
exists). -,...;.-.-. >i* J 

The inference is started with a {0} -Billy-Jones/inf label on Billy- Jones and a 
{0} -Jane-Smith/inf on Jane-Smith. By step {4} there are labels on CHILDREN-OF 
stating that Jane is a child of Mary and that Billy is a child of Jane. At step {7} Mary is 
bound to < the-person' (the typical member of PERSONS). From there, two applications of 
rule (bl) yield the fact at step {9} that Billy is one of THE-GRANDCHILDREN of Mary 
(who is still bound to *the-person'). From this, Billy is bound to 'the-grandchild* at step {10}, 
and the binding world 'w0002' is put on the output world-class 'WX\ This world interacts 
with the existing binding of Mary to 'the-person' to propagate (via rule tm2) the 
♦Mary-Smith label to GRANDPARENTS. Note that in order to apply (tm2), the 
-Mary-Smith/wOOOl label on 'the-person' must be treated as though it were 
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-Mary-Smith/wOQQ2 - this is legal because *w0002* ts stronger H»n "wOOOl' and thus VOOO? 
world-tags can be substituted f or V0©@l* ones #e^%adN* f fare 243d for a simpler 
example of this kind of interaction.) 

The conclusion at step {»} is that Mary Smith it a grandmother in world 'wOOOl*. 
This means that Mary is a grandmother in Inf, *tso. Consider: If Mary were not a 
grandmother in-Tnf, then a label to that effect &e. -MarfSrtifth/irtf) could be placed on 
GRANDPARENTS without ia*y inconsistency. However, such * febef can mtetact With the 
existing *Mary*Smith/w©eoi<siace VO00T U stronger than "rnf and tah thus be substituted 
for it) to cause an inconsistency. Thu§ Mary i* indeed a grandmother (in 'thf*). To put this 
another way, die wor# ^weew dtf fers from *ihf } om> in the presence of some additional 
bindings. Therefore aU labels which do not ref# to these bindings are as valid in *lhf fe as 
they are in wOOtH*. After an, MrOOOT is really only a bookkeeping device used to prevent 
invalid interactions with other possible bindings of •the-person', and it otherwise completely 
equivalent to *inf. 
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3 Using the Primitive Constraints 

This section describes some of the useful non-primitive constraint expressions 
that can be constructed from the primitives of section 2. The structure of this section 
parallels that of section 2, so that (for example) section 3.1 describes some uses of the 
primitive constraint introduced in section 2.1. 

3.1 Using the Partition Constraint 
3.1.1 Taxonomies 

The simplest combination of partition constraints involves arranging them in a 
taxonomic hierarchy such as figure 3-L This allow* * great saving of space, since (for 
example) it is not necessary to have an explicit subclass constraint between REDWOODS 
and PHYSICAL-OBJECTS - it is implicit in the structure of the tree. Indeed* if V is 
known to be a redwood (as indicated by "♦x/ r " ''M..R^^^$^ : .tihim^V.|a: known to be 
a PHYSICAL-OBJECT "(by applying rule pi four Umes). Hierarchical structures such as 
figure 3-1 are used in many kinds of "semantic network" representation* ~ part two of this 
document compares some of these with CE 

3X2 Intersection and Union 

Taxonomies such as figure 3-1 have a highly disciplined structure ~ each class- 
point occurs at most once as the superclass in some partition constraint, and at most once as 
a subclass in some other partition. By relaxing this discipline somewhat it is possible to 
represent all of the Boolean functions (intersection,, union, con^letnent, etc) Intersection 
and union are treated here - complement Is treated in 313. 
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Consider intersection: Given two classes 'A' and 'B«, it is desired to constrain the 
class XT to contain exactly those object* which are contained in both 'A' and V. Figure 
3-2* is a first attempt at this. In *-2a. XT i, a subclass of both 'A' and W, so every object in 
XT is necessarily in both W and V and is hence in the intersection. Thus XT is no larger 
then the intersection. However, Iff may be a good deal smaller than the intersection - in 
the worst case, XT could be empty (in which case all of 'A' would be in Vtf and all of V 
would be in 117. What is necessary is to prevent those objects which belong in the 
intersection X? from mistakenly ending up in 'AT or *I\ Now, if «AI' were constrained to 
be mutually exclusive with T. then nd object in 'A' which is ate in V could end up in «A1\ 
Hence XT would have to contain a&thi object. mthVtetersection. Figure 3-2b adds this 
constraint that *Af be exclusive with «B\ a. well as the symmetric one that TH* be exclusive 
with 'A'. i 
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Figure *2b does indeed constrain XT to contain >alf of the intersection: If 

object V is in both 'A' and V, theh it should be possible to infer that V is in XT. That is. 

starting a <x labef from bo* «A* and V should r«ulfi 8 a * on XT. Referring to S-2b. 

such a *x on V would entail a -x on 'At by rule (pi). Then the *x on *A> and the -x on 'AT 

would entail a *x on XT by rule <p3J, and we are done 

Having represented the intersection function, it turns out that the union function 
comes for free. Consider the rfatse* 'AT and 1? in figure j-2b. «A2' contains all of V 
plus all of *A' which H not in V. That i, to say, 'A2' contain, exactly 'A' union V. 
SIMM* Ttf contain, exactly**' union V. Since <A2' and Itf are extenstonally the same 
class, nothing fs lost by using the same intension^ class for them: Figure 3-2c does this and 
names the combined class V. Thus in 3-2c the da» t? repraents the intersection of 'A' 
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and 'B\ and the class *D* represents their union. , 

Since intersection and union occt*r fairly frequently, it can become awkward to 
have to reputedly write out copies of f igure fr% A* a more convenient nation, figures 
3-2d and S-2e show the abbreviated network symbol* for, interjection and union, 
respectively. One way to view them is as "macros" for the full structure of figure 3-2c - in 
an implementation, the union and interjection boxes would be expanded into the network 
of figure 3-2c,and tfja£ would be wjiat is stored in the data-base. 

Another way to view them is as "subroutines" -instead of being expanded, they 
can be implemented as new primitive constraints, which have a specified input-output 
behavior (as determined by propagation <ul«) without any regard for what "fine structure" 
the boxes might have, Not only dee* thu *v« space <j»c« the boxes are not expanded), it 
also saves time during the inference process - the specialued propagation .rules for the 
intersection box (for example) can operate direcOy in terms of the classes 'AV'B\ and 'C\ 
without having to worry about the internal state .of such points a» 'AT and 'Bl\ Of. course, 
adding a new primitive constraint doe* inaewe the coo^lexity of the inference process in 
that it adds more propagation rule*. In general, the decUton as to which non-primitive 
constraint expressions shoujd be left a* macros and whichts^ldberriade into primitives is 
a matter of implementation tradeioffs. For the purposes of the res| s of section 3, it suffices 
to note that just because some constraint expression is called "noncurative" (m that it can 
be represented in terms of the primitive constrawuts) does nor mean that it must be 
expanded into a (possibly large) network of primitives in an actual implementation. 

As an example, figures 3-3 and 3-4 show the propagation rules for the intersection 
and union "primitives", respectively. The behavior represented by each of these rules can 
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be derived from figure 3>-2c (for the cue of two tnputi *A* and *BV For generality, the 

propagation rates are given in term* of boxes which ctt-1iiwe#ioi«fh«tl two inputs - 

these N*way intersections and unions can be modeled in terms Of a cascade of two-way ones 

if it is desired to reduce them to primitive partition comtrainto. The rules for union are 

shown in figure 3-& 

(ul) If an object is in one Of the inputs, the** ifmdst be-to th^u^icst; 

<u2) If an object is not tetbe urtten, then it can isaHffr ihfafi^bf the inputs. 

<u3) If an object is m the union and it not in a«-buH»e of the inputs, then it must be in 

the remaining input. j 

(u4> U an object is not In any of th*4npots, then it e*a not be in the union. 

(u5) If the uaiatr contain* wactfy ow object; ^ml if th»tobj«t is. aot in aiWnit-one of the 

inputs, then the reina^ing iiiput rrost comaM «»«rf thai object 

Note that m tbeowfh (u# »e iiomorpWc to <p$ through ^ except that (pi) 
also makes use of the fact that the subclasses are disjoint -Th» cutis tor intersection are 
duals of those for union (a* might be expected): Tn% are shown in figure SHt 
(il) If anc*jecri*notm©»eofthe^»^ 

(i2> If an object i» in the i n terse ctio n, then it must be itf att the inputs. 
(iS) If an object is hot in the intersection and is in afHwiFone of the inputs, then it can not 
be in the remaining input. 

(i4) If an object is iaaflof the inputs, men it must be in the intersection. 
(i5) If an input contains exactty oneobjeot, and if 'that object is contained in alt the other 
inputs, then the interjection muct contain exactly that object; 
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3.L3 Complement 

The complement of a class is defined as everything in the universe which is not 
in the class. Thus the way to represent it using a pajtUioo constraint is shown in figure 
3-5a -- If T* is "everything in the universe,." .rjfeken, *B' is, the complement of 'A* (and 
conversely). This behaves property: If Vis to i'A', <h«n it cannot be in, 'p by rule (pi); if 
V is not in 'A' then it must be in 'B' byi-rule (pS), »U»©»**' £1$ T^ 

To constrain T properly, it is necessary to state that every class is a subclass of 
T\ This can be done (rather wastef ully) by having an exf*Ucit subclass constraint between 
each class and TV It can be done lesi wastef ully by having an explicit subclass constraint 
from the tops of all taxonomies (such as PHYSICAL-OBJECTS m figure H) to T <~ all 
the other classeain the taxonomy are then- tmpj^irty lubclaises, of T'. However, the real 
problem with defining T* is thai for a given data-base if, might be dif f icult to decide 
whether or not aH classct have indeed been constraioed to be(explictt or impficit) subclasses 
of T'. In view of this, it seems preferable to make wmplemem a n^priwtivc irwtead of 
a macro, Figure 3-5b gives the network symbol for coropterawK; and figure 3-5c show* the 
two obvious propagation rules: 

(d) If an object is in one of the complementary classes then it can not b« in the other. 
(c2) If an object is not in one of the complementary classes then it must be in the other. 

Note that given intersection and complement it M possible to define all of the 
other Boolean functions. Thus (as promised in waion 2.1) it has been demonstrated that all 
Boolean function* can be represented in twnwqf the pariitteo constraint. 

Figure 3-5d shows an example which, uses all three <of Interiection, union, and 
complement. The network to the right of the dotted Uo« represent* th« facts that fortunate- 
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ones are either wiie-onei or kicky-ones (or both), that unfortunite-ones are all those that are 
not fortunate-ones, and that all unfortunate-ones are unhappy-ones. Suppose that these 
facts already exist in the data-base, and that a user wishes to know if everyone who is both 
unlucky and unwise is necessarily unhappy Wjp1ir^ii^f^)Mlii^^''bifr constructs 
the network fragment to the left of the dotted tine: *Al' ft the daw of tthwise-ones, 'AY is 
the class of unlucky-ones, and *A3* is the intersection of the two (i.e. those that are both 
unwise and unlucky); To show that all those tf 'AS**!* necessarily unhappy (i.e. that 'A3* is 
a subclass of Ur^APP^ONES), the .'fofay iabelfng of **x* on 'AS^and* "-x* on 
UNHAPPY-ONES it 'usett' (as diseased in section t*%. •' '•* thu produces in inconsistency, 
then the user knows that ail unwise and unlucky one* are indeed Unhappy. " 

Ffoitf the teitial^ ^ %?. From 

these, rutefcl) propagate -X to WISE-ONES and 40CkY-$N!^T^ 
-x on FORTUNATZ-ONES From thU, ral«(c2>|m)ducei>3t on UNFORTUNATE- 
ONES, which ruteipij finally propagates as ««i)MtlHMm>i1NlMt5^i^s» collides with 
the initial -x on UNHAPW-ONSS, indicating an hwonttstencf. Note that this fctference 
in effect proves one of DeMorgan's laws (specifically, i^'#<^)&«flW$jk which are not 
so easy to prove innrwst other logkal systemj. -■<.,■„.;■ .--^yv 

3.2 Using the ObjettGeitstraiitt: Bfistfnet Ob jeets 

One issue that was not reaBy resolved in section fc2 is that of distinct objects. In 
CE. there is no easy "syntactic" check for object equality (such u comparing print-names) - 
deciding whether or not two object* are equal can involve an arbitrarily complex inference 
However, in most cases it it positble to structure the data-btie so th»t the object-equality 
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inferences are very simple. 

The basic idea is to use a taxonomic hierarchy such as fjgure 3-1 tg i structure the 
objects known to the da^-bas*. That U, ever >l known dj*Uf>ct objea wijj occur at the end 
of some branch of the taxonomy tree. For exan^, SEQUOIA-NAT'L-PARK'S- 
REDWOODS might be par$poned in» c^j^b« classes which represent, the known d««»n« 
redwoods. Now^ these, redwood* are known to be cUstwct among themselves because they 
are all subclasses of th« same partition, (and hence arj .nj^lljf^clMsJve). Furtheenwre, 
each redwood is known to be distinct from all the f s^-$£y^^£sqa»»*t somje teyej in 
the tree there is a partition which puts REDWOODS and -fft£<0ft** W»»* in g tne non- 
redwood object into distinct subclasies. For exarnj^, a glyeii redWsOod and a given 
bacterium are known jo be distinct became of the partUwiing of ANtt^ATi>p;BJECTS. 
In general, every objfct<al the end of a b^nch^th^^oiiora| trjse> is known to be 
distinct from evejy other such object becauiej at some place above them is a partition 
constraint which puts them into mutually exclusiye classes. Thu* when adding a, new 
Sequoia National Par* redwood (for example) to the daa-pase. >t i* only necessary 40 rnake 
it explicitly distinct frpm the other known Sequoia Natiojia|«Park ledwoods T the hierarchy 
of exclusive partitions above it will insure that it is distinct from every other object in the 
taxonomy. 
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3.3 Using the Binary Relationship Constraint 
3.11 Transitive Rotations 

If. for example, -a dachas* xontains th* information than Aristotle is taller then 
Plato and that Plato irtiMter t^-Soalfe*, xh^n it should be may to infer that Aristotle is 
taller that* Socrates. T**Js is easy to do if *«fryofi£s height uawoWn^ihetnairty (eg. Plato's 
height is 70 ihc*es\;bi!t it is not so easy wtien s^*coi«p^'«f^^ 
This section shows 1iow traikitiv*^ 
within the CE represwitattoii even in the absence lit complete n^efeicai ir^ormaUon. 

The key to doing ifrfc h'thi^'eWiatoi^^Ji^li-ht^in^t not be constrained 
in terms of a metrical total %iderihg; it U constrained in terms of a partial ordering with 
respect to A^is»tle> h«g4t j^Sdttates* te 

"stacking* of the btrwry t^ttoriship constraints nadir kkiJ^-giflra convenient 
notation foravoidihg tmss1itg*m^:mtheT*«^ alt three 

constrain»ha*e1I^GHT-«^ as their 

are meant to present TShle^par^l otd»tt% --'the 'ima^^^k^M&uk'to concerned 
with exactly liow isuch partial or*BrtiigS can be represented 'y%bi^ f ©f ^ 'listing 'CE 
primitives. *' v ■ ' .-•■■u:-.- n^tun,^- ■■: '■ '*'■■ 

Since CE already handles the transitive relations "subclass of" and "superclass of" 
in an efficient manner, the -obvious thing to do is to represent "height greater than" in 
terms of these. That sis, Aristotle's height will be known to be greater than Plato's height 
because some class associated with the former will be a superclass of (i je. greater than) some 
class associated with the latter. Since "superclass of" U reflexive <i.e. 'A' is a superclass of 
'A') and "height greater than" is not {Plato's height is not greater than Plato's height), this 
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section first describes how to represent the reflexive parrtal ordering "height as great as" 

(i.e. "height equal or greater than"). Then a technique for representing "height strictly 

greater than" it described. 

To represent "height as great as," a new binary relation HEIGHT-METRIC is 
needed. This maps from an individual height («.g. PlatoVheight) to the appropriate 
"metric height" The metric height is a class such that one individual height it as great as 
another individual height if and only if the former's metric height U a tuperchus of the 
tatter's. All this is diagrammed in figure 3-6b ? CAMH*. "PntHVind 'SMH' are the, metric 
heights for Aristotle, Plato, and Socrates, respectively.) Now, showing that Aristotle's height 
is as great as Socrates' reduces to showing that SMH i$ a subclass of AMR which is easy 
(Just start a *x at SMH and apply rule (pi} twice). Of course, a w«>JWeugned user interlace 
to a CE data-base would contain "macros" to expand constructs $ucb as "A is as tall as B 
into the appropriate GE network - the user need not be aware of the low-level details 
involving metric heights. 

The use of metric heights as in 3-6b does indeed behave properly, but it is also 

!■■■■■' 

helpful to have some intuitive interpretation of what a class such ai PMH "really" contains. 
One such interpretation is to consider PMH to contain as objects the integers from to 
some N. This number N then can represent the indiv«du*4, height in some arbitrary units 
(such as inches). Under this interpretation, PMH would be die ehm of integers 0,1,...,69,70 
(since Plato's height is 70 inches). Note that the data-base neid never jrefer to these 
number-objects explicitly ~ figure 3-6b uses the partial ordering among the classes AM H, 
PMH, and SMH without ever referring to the objects contained in them. It is clear that 
this interpretation for metric height classes satisfies the requirement that one such class be a 
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superclass of another iff the former represents^ greater (or eoual) individual height -one 
sequence O.U.N is a sup«rCl»$s of another one 0.K....M iff MsN. 

Given this technique for representing "as great ast^Ki* easy to extend it to 
represent "strictly great* .thanf AH that ^t^0mm^W^^^lSpaBik- of "superclass." 
For example, AMH is a mm superclass Jo? Vi^ltf AM1frWKm0tfc& of PUfi and 
PMH is not a sdperetesi Of A*fH. That ij, iff AMH a f superclass of PMH and there is 
some object which is In AMH but not PMH. Figure $6d foptiserlts'triisf A&H is still a 
superclass of PMH, and AMH is known to contain at feist oh*t^jett fhl*) which is not 
contained hi PMH. Simflafly; PMH is a strict superctau of SMH. 

Now, even though it is easy enough to state the fact that dne class is a strict 
superclass of another, w i#«not so easy «r infer sulh a facC' Soerri ^strict superclass" 
inference (involving AMH ami PMH, for example) mutt m^lmmmm'm^-^the first 
step is rodewrn^elhat AMH i»-mde«rast*0er^ Via the 

usual procedure <i.e. by starting « *xMnf from i>-Pim.'ii)tf:«'-H4rif^i^4ililt>)u<d then 
watching f or »iv inconslwtncy). The second «*p is to g*d *** obj**' which is in AMH 
and is not in PMH - uicfe "fitKt'inf4wenctsar«d(s«H«^ ^apoewi*^^ 

It i^^ 
in terms of the uiidertyiiig semantics, and not ^ termf^f 4Mi?jtili^0n#sie&-^or example, 
the user interface to a GJE d*ta***te might allow statements such as 
"talto-than<Afistt|tle#^o>" l r»with^e;imer#a€e expanding fhiuuttement into the network of 
figure S"6b. The use of. such "macros' is also dfecussed in section M% ; - ..the important 
point «**hat this document is concerned with demonstrating CEVf unetional capabilities, not 
with specifying a partioiiar syntax for the user inwif ace. 
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3.3.2 N-ary Relations 

Codd's Relational Data-base scheme [Codd 1970] uses N-ary relations as its 
primitive construct Section §,2.1 in par* two compares ^ Codjd> syfttem ^b CE in more 
deuil - this section brief ly describes h$w N-ary relation* can be represented in CE in terms 
of binary relations. An example N-ary jelajtiee is. the following for persons: A person lias 
a name, a mother, a father, a sex, and a blood-group. Thus the relation tuple for Jane 
Smith might be Qane Smith\ "Maj-y Smith". "Joe.$fnith", "f emaltV "ab") why* the 
double-quotes indicate that the items in the tuple are chawcter-strings. 

One common way of representing N-ary relationf in tflrms of binary ones is to use 
a separate binary relation for each "slot" of the N-tupte. Figure 3-7a uses this technique to 
represent the above N-ary relation. In this case, the binary relations used for. -the 
individual slots are NA^E-OF, ^ME-QF?MOTHER-QF, NAME-pF-FATHER-OF, 
NAME-QF-SEX-QF, and NAME<>F-BLQQrHJROyj>-OF, Aiaboite, the double-quotes 
in 3-7a indicate character-strings: For examplMhe NAME-OF-MOTHER-OF Jane Smith 
is an object which is the xharactfr string "Marv Smith". 

Since CE cat* represent objects (eg. people) directly (and not just in terms of their 
names), it seems preferable to use a binary relation MOTHER-OF (which relates a person 
to the object which is their mother) instead ofjJ^AM^-Q^fOJHE&OF. Figure 3-7b 
does this for all the binary relations in 3-7a except for NAM^& w>MeM*syil a character 
string. 
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3.4 Using the Inverse Constraint 

3.4.1 Total Relations 

Section 2.4 shows how »the inverse constraint can be used with the binary 
relationship constraint" arsbtfc that every object ifr tfcVdbiliiir* ©i ^ relationship is indeed 
related to sorne object in the range. Fot example, figure 2*9a sates that every person has a 
sex. Tlte construct uisd it* 2^^ Figure 

3-8a shows thi* symbol for a "Iota* binary refctiomhip* and 3-Sb ihowi what the symbol 
means. As in section Si, this symbol can be etfher a inacrd" fir 3*b, or it can be a new 

primitive (if implementation drcurnstances warrant it). 

, i ■ 

3.4.2 Quantification 

Figure 2-» shows how the class of GfcAr*DPARtr#S catt be defined by using 
the typical-member e&rwtrarrt As can be seen, this te a rather complicated definition, and 
the t-m constraint itself seem* mere comptoc than the other conttralntt discosted in this 
document. In view of these comjsftwitief, ft is fort u na te that tmmy rnstances of such 
quantification can be accomplished simply in *erm$ of bhwof reftrttomrhip constraints and 
inverse constraint. 

For example, figure 3*c giver a much simpler definition of GHAN&PAfcENTS 
m terms of PERSONS and - CtflflMtttK)?: A grandparent It an object which has a 
child which has a cMM which is a person. Of course, the class *B.f represents the relation 
PARENTS-OF (the inverse of CHILDREN-OF), so another prose translation of figure 
3-8c is that grandparents are the parents of parents of persons, in any case, 3-8c is certainly 
much simpler than 2-19. Section 5.3.6 in part two discusses more about CE's quantification 
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mechanisms and how they relate to the corrt^onding m ccbat t i HW of other representations. 

3.5 Using the World Consja-aint ' 

3.5.1 Hierarchical Contexts 

Section 2$ mentioned, that worlds can be used to represent a particular "point of 
view" of some user of the data-base. For example, |e| W-jy^E|S^CITH M a world-class 
which represents all the worlds, which afe consistent with what Jaoe Smith (a user) believes. 
Then figure ?-9a represeptjtbe inforjn|ti^„that Jane jtajth JbfMvef ?b« aU draft evaders 
are criminals. That is, Jane Smith's "view" of the data-bast includes this information. 
This becomes relevant when Jane Smith initiates some inference involving thejjlass 

CRIMINALS -- from her point of view tha| cjasj include*, draft evader*, while from some 

i .■ *' ' ' ........... 

other user's p^int of view jt m^ht no|. 

When perfprming such an inference, »t is necessary for the information attached 
to W-jANJE-SMITH ® be somehow "enab|^" f or, the,p^rjitio§p/fh« infe/ence |n a 
manner whi$h does not conflict with other user's pojpu-of -view. To dp this, a nnf label 
is started from W^ANE-SMITH ajpart pf th? initial labelipg.. This *inf can then 
propagate to classes such as *W ^3', which enable* the appropriate information. This is 
indeed the desired behavior. As for what^te ^^lt^L"n|||^.|Hinin4,-^nf t in th« class 
W-JANE-SMITH meanAthat *jnf 'J* one of ttewo^d* consjjsMn^jWith Jane Smith's be^efs. 
That is, what Jane Smith believes 4s considered to be true for -the duration of. current 
inference. 

Now, classes such as W-JANffSMITJi can often be structured within a 
hierarchy. For instance, Jaw South rrught be working, on a project w«hin some group in 
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some agency of sen* department of an organization. So Jane Smith has her own private 
view of the data-base^ which is a specialization of the view taken by the project as a whole, 
which in turn is a specialization of the view taken byiiff 'filtip & a whole; etc. Such a 
hierarchy is shown in figure $-9b. Note that there may be &^ i W^iMW^' hierarchy 
(forming a tree) -- the project might have tiers in : additiotfii Jifie *tib have private 
views, the group might havener projects, a^ 

The hierarchy in figure 3-9b is meant to parallel We formal organisation chart 
for Jane's department: ^espohs&ll^ for changing ' , tl^'(^ 3 9^li^' i yrlb^' : fan>e1s'jearf be 
given to administfatdrs at those teveE This Way, pertfey decision* made at the agency level 
(for example) can be reflected a* infonnatiOT in the agency-lev^ 

becomes part of every view betow it in the hierarchy. In parttebftir, any lower-level view 
which is inconsistent with the (updated) agency-Ievd view Will cwise seemingly spurious 
inconsistencies to appear during infensnees uiing that fcjwer ffl&j. f W l«e r prbcess of tracing 
this down (debugging the 'data; Wit wereX this ^M^^^miM lowerMevel and 
agency-level view wilt h^t^tetaeMfc Having l ;iiffldP^%&fcdiikitf t Wtthift the 
organization, it l* necessary to eWttr refWmutatr«he ag«ncy*fe»tf poWcy so $* Wirj&feoftf lict 
with lower tevefc; wto re*fcf^ with 

the view impelled fnom ! *Sbve: %te point of thi ~&tffa&W ^^iriisil^ »n integrated 
data-base withm^anargftt^abh mighf>¥oviie%p©we#tt^ to 

insure that the organisation's hig^eHevel goals ^ is being 

done at lower levels. 

Ariother tises for taei^bie* stich » Since the 

only way Jane SnttthVprivate'tiewlf^ be put 
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on the class-point W-JAN&-SMITH, it folbwt that wnwon* without acc«*» to the class- 
point is also without access to Jane Smith's view. Thu* by cpnt^Mng access to a limited set 
of class-points it is possible to confrol access to ajl the information attached to them. In this 
way, the security and privacy structure of the data-base sao be made quite independent of 
the particular pieces of data to be stored within that structure. 

3.5.2 Naive Probability 

This section shows how a world-class hierarchy can be used to handle qualified 
information, such as "Itto likely that Jane SroUh'i mother it M**y Smith," and "Most birds 
can fly." The basic idea is to set upjw ordering such as figure 9-9c. Th* categories such 
as "very likely" represent a division of the probability continuum into discrete pieces. No 
attempt will be made here to assign exact numerical values to these categories, and indeed 
they will not be used in a manner which supports standard probabilistic calculations. For 
example, it turns put that the probability in this scheme of "P and <QT is the minimum of 
the two individual probabilities, instead of being their product (under *he assumption that 
P and Qare independent). 

In any case, it is necessary to explain exactly what the classes in 3-9c mean and 
how they are to influence the behavior of the inference process. Like other world-classes, 
the ones in 3-9c are used to enable various pieces of information. In thi*«ase, a class 
presumably enables information of the appropriate probabiH^ For example, W-UK ELY 
might enable "Jane Smith'* mother is Mary Smith. 1 " which would mean that it is likely that 
Jane Smith's mother is Mary Smith. Now, the classes in 3^e represent! the different levels 
of "reliability" which a user may demand of th* data^basei That is^ ifi the user wants 
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answers which are 'certain* then only the information tnabled by the chits W-CERTAIN 
may be used in deriving such answers If *e user % wrthn^ to accept "almost certain" 
answers, then the information -enabled by W-ALMOST-CERTAfr* can be used in addition 
to the W^ERTAIR htformition. In general, if throser demands a particular kivel of 
reliability, then only information which is at lea^tr^fcelfiite may be used during the 
inference process. 

Thus the general strategy is for the user to pick a reliability level (such as 
W-ALMOST-CERTA1N) and start a «*nf tabet ftomihit Class; this '♦snf will then 
enable all of the information attached to flkt tetef^and wHJ also propagate to all higher 
levelrfouch as W-CERTAB*), enabling this information toa Inadehfciffy, this shows why 
the probability of "P and <£ is the mirtimum of the individual probabilities: **P and <JT is 
inferable when both P and <£ are enabled, which occurs when : -ti& 'least probable one is 
enabled.' /■■• : i-, ■'■'-'■•": ■"•;■■■'■''■■ ■■■■ '"'■■ ■--;-- *- v,; ' 

NOw, some users might not want to be required to : afc*tfyftfttifeHtejr level before 
initiating an inference - instead, they might like the data-base to make rhfr inference and 
then tell them after it is done what iu reliability level is. T^ ianibe accoiwpl!shed in 
several ways. The most bru*e*orc« ft ^ ^ level of 

"certain." If that producer no aniwer, then the next lower rHvtbHity can be tried. In 
general, the inferemt can be repeated using successively towel* retia%ffity levels until it 
produces an answer at some level (assuming that ft produces anahswef at all). Then this 
answer has the reliability of that level, which is the htghtit appttcable one since the levels 
were tried sequentially, in deffeastog brder. 

As with the t-m constraint, soch sequential proceuiflg can be eliminated by using 
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more than one world at a time. In this caw, the Idea is & start a <4^fereru) workj-oteject 
label at ej^ relUbiUty level. Then the one with highest ceUaMuy which produces an 
answer represents the reliability of that answer. A good way to implement such a multiple- 
label scheme is to use the world-tree mechanism introduced m ; sejetiw? £6. In this case, the 
world-objects started from the classes W-£pTAJ*i W-A^IQ^T-CpTAIN, etc.. will be 
called 'w-certain', V-almost-certain', etc Furthermore, these objects will be arranged in the 
world-tree in the manner of figure 3-9d. The great advantage of doing this (instead of 
letting the different world-objects act completely independently) is that the different worlds 
can interact: If for example a label with a worW-ug of Valmoit-qtrtaln' interacts with a 
label with a tag of *w-likely' at some constraint, then |he resulting labels will propagate with 
tags of Sv-likely' (the "stronger" world -see 26 for a fuller discussion of such interactions). 
Furthermore, the resulting reliability of the entire inference can be seen immediately by 
looking at the world-tag of the labels involved in the collision which ends the inference - 
the world-tag at the collision will be the reliability-level of the answer. Now, the details of 
the foregoing are not too important - the significant aspect of it is that sequential 
processing ("time") can be rather easily traded for multiple worlds ("space"). 

Note that the probability mechanism of this, section can interact with the context 
mechanism of the previous one. For example, Jane Smith's data-base view might include 
"it is almost certain that draft evaders ,ju*e crimigals," Figure 3-9$ diagrams this - Jane 
Smith's view enables the attachment of "all draft-evaders are criminals" to the "almost 
certain" reliability level. 

Also note that information which is not explicitly enabled by any world-class (via 
some world constraint) is always "enabled" (because it is not relativized). This means that 
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information which is not attached to a reliability world*ctetti is assumed to be maximally 
reliable, and that information not attached to aNte»* world-class is part ofHili views. 

'3.5.3 Relativiiing <^her ConarMMftt ..-i^Hii:.)..:.^; 

Since the wofii constraint i* a relativized version of the subclass constraint, it 
seems reasonable to ask why relativized versions of the other primitives constraints have 
not been presented. The reason U that the relativized subclass constraint can be used to 
"insulate" any of the other primitives such that the primitive is accessible only in the 
specified worlds. For example; figure s 3-9f shows an unrelat^Ued^ binary relationship 
constraint (which was copied from figure 2-4e). Rgure^§«$g reUtiviies "iTto the world class 
•W": The twin worla* constraints serve to disconnect 1 * one side of W except for worlds in 
*W*. The same technique can be used with all the other primitive*. 

There is, however, one real difference between the world constraint and 
relativized versions of other constraint! The world constraint can process *-w" labels on its 
world-class by propagating %bels which ieny that the sUbtiass ? conaSalnt holds in world V. 
For more complex constants, however, it ''ii'^pne^'pa^Oe io'tM t pattern of 
labels which represents the denial of the constraint, Tnte is Sraus^th^more complex 
constraints involve a conjunction of conditions - If the whole Hi'ing is to be denied, it is not 
known which of "how i : mahy" : if the* M'j^^ih^^im^^m'^t^'it no real 
problem - most application* 'of 'n^fted constraint "mvolve ^'labe^tforlhe purpose of 
"enabling" the constraint in the specified worlds) and not "-" ones!" For example, "-"labels 
play no necessary part in &# i£iftiffiid'd&M&^^ " M 
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3.6 Using the T-M Constraint: Templates ., 

Figure 3-7 shows how an individual Codd-jike. ^-tu|)« can be, represented by N 
binary relationships, each one specifying one ;.."^ibute" of ^^ldiyidual (i-e. on* slot in 
the tuple). Now, it is not meaningful Ju« to asilgn any ajbijirary ohject as the value of 
some attribute - presumably the MOTHE^F J^,|0^,mu^ J^^| fernale pe«on, the 
SEX-OF must one of the SEXES 'mate' or *female , , etc, Thm^onsjr^ntsycan be specif ied 
for persons in general as shown in t\^p^%^J((^^^jpiff^^ network in 3-10a 
have appeared before as isolated fragments in previous diagrarn» - figure 3-10a begins to 
hint at the kind of rich interconnection which occurs in any non-trivial CE data-base. 

Figure 3-10a is an example of a "template": It uses a t-m constraint to describe the 
typical person (or whatever) in terms of its attributes. Note that only one t-m constraint is 
needed in describing all the attributes. This is fortunate because t-m constraints are rather 
complex computationally and it would be burdensome to have to go through the 
computations separately for each attribute. Of course in a real data-base a person would 
have many more attributes, but one t-m constraint would still suffice for all of them. 

As shown in 3-10a, the template only specifies the functionality (e.g. one-to-one) 
and range (e.g. female persons) of the various attributes. This could have been done 
(rather wastefully) by using a separate t-m constraint for each attribute, such as figure 2-13c 
does for MOTHER-OF. However, using only one t-m constraint as in 3-IOa has the 
further advantage that it allows the specification of constraints between the attribute 
values. For example, presumably the-mother and the-father are married (ignoring 
complications such a? illegitimacy and divorce), and furthermore they are married only to 
each other (ignoring polygamy). Figure 3-10b shows a network fragment which expresses 
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these constraints - the class-points ^he-mother* and "the-father' in 3-10b are meant to be the 
same ones as in 3-KJa. Figure 3-I0b also includes the general information that 
HUSBAND-OF and WIFE-OF are inverses, and defines SPObSE-OF in terms of them. 

In summary, a template for a claw provide* a general Structure of constraints 
which must be ^km^'^^^J'^ ob>ct wSclasTVhese constraints can 
involve either one attrfbuttOr more than one. Structurally, a template consists of its t-m 
constraint, its relevant *ft of fcftL & \tot $ i and a net^l6ra4trainu which must 
hold among the values of these attributes. 
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Part Two - Epistemoloyv. etc. 

Having examined the detailed structure of the Jfees(ty part one), it ii now time to 
look at the forest Section 4 briefly discusses why tepreientation iisue* are imporunt in the 
first place, and.dividet the CE representation into four mpre*r-leu independent "layers." 
Section 5 then discusses some of the representttioittl issue* appropriate to each layer. Ifsing 
these issues, section 6 compares CE with other data-base and artificial-intelligence 
representations. 

4 Generalities 

4.1 Why are representational issues important? 

The representations used in an information-processing system fundamentally 
affect the kinds of structures which an be built and the kinds of processes which can 
operate on these structures. This is true even if two represenutions are in some sense 
equivalent - the differences in their basic structures will still affect their macroscopic 
behavior. For example, consider the difference between Roman and Arabic numerals. 
They are formally equivalent in that either can be used to represent any positive integer. 
However, it is extremely difficult to do long division (for instance) using the Roman 
representation, much more difficult than if the Arabic representation is used. In this case, 
it is relatively safe to say that Arabic numerals are a "better" representation than Roman 
numerals — the procedures for manipulating the Arabic ones are easier for humans to 
perform. 

If a difference of representation can have such a great effect within the simple 
domain of arithmetic, it is not hard to imagine the correspondingly greater effects such a 



64 

difference can have in more complex domain*. One such domain is that of computer 
algorithms, and this domain has literally hundredi of ttfftfent representations (i.e. the 
various computer languages), each of which has some partisans who acclaim it to be the 
"best." It is not th* put pose of this document *# engage in such a language debate." 
However, it often is instructtve to compare a new representation wtth existing ones in order 
get a better grasp of its strengths and weaknesses, ft ii with this in mind that section 6 
compares CE with other representations. 

4.2 Why is the CE representation interesting? 

This document makes three claims for the CE representations. The first is that it 
has a great deal of oxprossivo power - it is sufficiently rich to be able to 
represent a wide range of mformatton. The second claim is that CE has a firm formal 
semantic** - thtt allows precise statement* to be made about the meaning and behavior 
of the various CE expressions. The nature of this semantics is discussed in section 5. The 
third claim is that CE has a high degree of modularity fin several senses of the 
word) - this allows various "parts* of the representation to be discussed without worrying 
too much about how they wiff eventually fit together into "whoWV Also, this modularity is 
largely responsible for CE's ability to make use of parallelism (discussed below). 

4.3 Modularity - Uyered Decomposition 

The fact of CE's modularity permeates this entire document: Section 1 discusses 
label-propagation without regard to ho* it will be constrained] section 2 discusses the 
primitive constraints without regard to the macro-structures whkh wiBbe buift out of them; 
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and then section 3 discusses these macro-structures. Using this decomposition, it is possible 
to discuss CE in terms of four separate j^u^n)l ; "j|ye|i." TJ^y ate; 

(1) The idea of label propagation in general; 

(2) The particular labels described in section 1.4.1; 

(3) The particular primitive constraints described in tejrrns of theif labejf; . 

(4) The particular macro-constraints made out of these primitives. 

Section 5 discusses the different representational issues which are appropriate to 
each of these layers. Having a layered decompositioji such as this makes a complex system 
much easier to understand: When studying constructs a.t one layer, the constructs of the 
previous layer can be considered to be "atomic" and their fine-structure can be ignored. 
Indeed, Simon [1969] proposes that humans can understand complex systems only when they 
are layered in such a manner. , 

Another advantage of a layered system is that the upper layers can be modified 
or thrown away without disturbing the tower ones. S|nce creating each layer of the CE 
representation involved its own design decisions {some rather f rbjtrary), it .^ guite pjwible 
that someone else might want to create a similar system incorporating different design 
decisions. In doing this, the layers below the one to be changed can be carried over intact 
- it is not necessary to re-excavate the foundation in order to repaint the roof. For the 
benefit of those who might be interested in making tuch changes, appendix D discusses 
alternatives to some of the design decisions wh|ch .a/e^emj^ied^ in this, document. 
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5 Some Representational Issues 

This section presents some general reprwentational issues and discusses CE's 
position on them. Section 6 below below dttcuwei other representations* positions on these 
same issues. Section 5 often refers to "other representations" in general without giving any 
details - these details cant* found in selrt^B:^^^ ^^ ^ ° ; ^ 

5.1 Issues relating to label-propagation in general 

5J.1 Modularity - Relatmg1^"lj'^tebar '' ! 

Section 4.3 discusses one kind of modularity - the ability to decompose a complex 
system into more-or-less independent layers. This section considers a different kind of 
modularity, which allows complex information to be represented in terms of more-or-less 
independent chunks of "local" information. Now, all of the representations discussed below 
have this kind of modularity to some extent: They all represent their information (however 
complex) in tetms of ixkntiiti of locally-meaningful ■primitives." However, the 
representations differ with respect to how a given chunk of local information interacts with 
another. It Is this inteftcHon whlcn attows more "global" information to 1* built up out of 
the chunks of "local" information. * ' «.. 

In CE. label propagation provides the one and only chanrter for such interaction. 
This is an fcxtremety Simple channel - most other representations use far more complex 
ones. The simplicity of the channel makes it easy to describe the global meaning of a CE 
expression in terms of the local meanings of the primitive constraints - one need not worry 
about possible complex interactions, since there are none. In general, minimizing gratuitous 
interaction greatly simplifies the task of analyzing and/or synthesizing complex ("global") 
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expressions in terms of simpler ("local") ones. Many of the benefits of "structured 
programming," for example, are due to exactly this kind of modularity. 

5.1.2 Logical Semantics 

The use of a simple interaction channel also allows the precise specification of 
what a representation's primitives mean in the first place. That is, since all interactions 
occur through a well-understood channel, the onjy, effect a primitive (or other) expression 
can have is in terms of how it transmits and receives on that channel. Thus the complete 
meaning and behavior of a primitive can be specified in terms of its input-output 
interactions with the channel. On the other hand, if a representation uses a complex or ill- 
defined channel, then it is usually impossible to completely specify a primitive's input- 
output interactions. That is, it is not well-defined what the representation's primitives mean 
locally and how they interact. 

Woods [1975] and Hayes Q974] both protest the fact that many representations do 
not have a precise semantics. This is not to say that representations without a logical 
semantics are "meaningless" -- they may work very welt indeed on certain kinds of 
examples. However, it is usually impossible to infer from the author's examples anything 
about how eVen minor changes in some detail will affect the global structure. Especially, it 
is difficult to see how far the representation can be extended to cover new examples. 
Another problem caused by the lack of a logical semantic* is that it makes it very difficult 
to compare two representations - if it is unclear what one (or both) really mean by some 
construct, then precise comparison of the two is impossible. 

Yet another problem is that lack of a formal semantics can encourage sloppy 
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thinking. For instance, a primitive without a precise definition may end up being used in 
examples where its name seems appropriate, even though no real mechanism is given for 
handling those examples. Woods D975] and McDermott 09751 point out many cases of such 
"wishful mnemonics" and other kinds of sloppy formulation. To be sure, having a formal 
semantics does not make one immune from error, and for some representations the required 
formalization would take more effort than it would be worth However, having a logical 
semantics is definitely an asset when trying lb use, study, or extend a representation. 

5.1.3 Procedural Semantics 

A logical semantics tells what a given expression "means" a procedural semantics 
tells how the expression •behaves." Presumably, a representation exists for the purpose of 
its being used, so one is ultimately interested in how it behaves, Ift a representation such as 
CE which is based on label propagations, the logical semantics and the procedural 
semantics are tightly coupled: An expression's meaning is defined in terms of how it 
interacts with various kinds of labels, and its behavior is determined by these interactions. 

One advantage of having such a tight coupling is thit it makes a representation 
easier to use. Instead of having to keep in mind both the meaning of the "data" (expressed 
in terms of the representation) and the behavior of some external procedure which accesses 
it (presumably expressed in some programming language), the data itself specifies both its 
meaning and the behavior of the accessing procedure. Of course the data might not be 
directly executable by some given piece of hardware, in which case a simulator is needed. 
Appendix B discusses three different possible implementations of ' CE -- one made of 
parallel hardware which executes the primitives directly, and two which simulate this. 
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Another advantage of the tight coupling U that it makes a system easier to debug. 
Since the accessing procedure operates in a manner which parallels the semantics of the 
data, the state of a procedure (in cafe of a crash, for instance) will be in semantically 
meaningful terms. In the caie of CE, the inferenct procedure'* «ate can be defined as the 
states of all the class-points (in terms of what labeU are on |hj^)^ J"huf if there is a crash, 
the inference procedure's state will be easy to express in a semantically meaningful manner 
(e.g. "the object V is known to be a person"), and things can be debugged on this level. 
Furthermore, it is the case that labels are never erased during an inference - new ^ones may 
be put on a class-point, but none may be removed. Thus. not only the xurrent state but the 
com p let e h if tory of previous states is available during debugging. Note that "error 
analysis" can be viewed as a high-level form of such debugging, wherein the "bug" is. some 
kind of inconsistency caused by bad data. In this case, the history of the relevant inference 
can be used to determine the path of dau-accesseJ that kd to. the inconsistency, which is 
necessary (although certainly not sufficiwO for determining exactly where along that path 
the erroneous datum lies. 
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5.2 Issues relating to CE*s particular labels 

The remarks in section 5.1 apply to any system which operates via label 
propagation. The remarks in this section apply Only to systems which use labels similar to 
the "Kibj*, "-Ob j*. and "««dbf labels of *€£.' Apperwiix D discusses a simifcr labeling scheme 
using slightly different labels - most Of the remarks In this section apply to that scheme 
also. ' : 

52.1 Fregean Systems 

CE (and all the other representations discussed in this document) are "Fregean* in 
that their universe consists of discrete objects and relationships ainc^g objects. In CE's 
case, this is a consequence of the fact that its labels are defined at relating discrete objects 
to classes of objects. The reason for mentioning all this here is that there seem to be 
certain limitation! on what Fregean systems ctli represent. Hayes 0974] discusses this in 
more detail. One of his examples is "substance* A substance (such as water) is usually not 
thought of as being composed of discrete objects. A pail (or drop, or ocean) of water seems 
to be a single distinguishable object, but what of the water itself ? ft is the existence of such 
issues which indicate that the final answers to the representation problem are by no means 
available (especially for sophisticated Artificial Intelligence applications). However, the 
state of the art is such that it does seem profitable to apply AI technology (such as CE) to 
the problems of data-bases. 
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5.22 Consistency Checking 

GE's labels are designed |o facilitate consisfcncy checking -? an inconsistency is 
detected iff a Aobf label collides with a %obj" <or %^bj"),»l jome<tass-point. Section 1.4 
shows that redunda^y checkJfcg and the answering of yes/no <p««stiqiis ean be subsumed 
under consistency checking; appendix A shows how this can be extended to "find" 
questions. An important raakwprld data-base application of such constotency checking is to 
validate incoming data in terms of what is already known. If an incomjstency Is detected, 
then something is wrong and the appropriate actions should Joe taken (such as rejecting the 
bad data, rejecting it and Jogging,* recqrd of th^ iflconiUtency, a*king a human to correct 
it, or attempting some for#», of automatic error ^^ a^lysU andjcofl^ion). .., 

5.2.3 Additions / Deletions I Updates 

Using CE, checking the consistency of new data, w&h 9fipff*4» the existing data- 
base is the way to validate it before adding H to the data-base Some representations which 
clo not have a awsisteocy^hjecking procedure d^howextr, bay? other means for .handling 
the addition of new data. Simi^rly, some ^epr<serjaUc«S|ve^id« special means for 
handling deletions of existing data. An update, of course, can, be considered to be a 
deletion followed by an adejition, so those reprewnutiora which handle additions and 
deletions can also handle updates. In addition, some xep resfn taJ U ons have a separate (more 
efficient) means for update processing. 

CE does not provide any additional mechanisms for handling deletions (and 
hence updates). In general, deleting a piece of network from a CE data-base can not 
possibly cause the remaining data to become inconsistent: Inconsistency is a state resulting 
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when "too much" is known (in particular, when both an auertton »nd its negation occur), so 
deleting some data from ant exiting data-base (whfch, amttropomorphieaHy, makes the 
data-base know less) is * safe operation. Notrthat deteinj the datum V means that the 
data-base after** diletfow doet rto* lin^^ data-base 

knows "not a." i 

The orw problem which can ahsedurini; t deletion it thWsonW data might be 
stored redundantly. For example, the d«t«Nns* might contain both "finteSmith is in the 
class FEMftEES" and the SIX-OP Jan*»ShHrh ii^emite? fifertg with the general 
information that FEWM.E8 tr exactly tJwclairof objects which *ha*a sex* of Temale"). In 
this case, deleting "Jlne Smith is in the ck« ^PlMMlS^wn^^cauitf^rne data-base to 
forget the fact that Jane Smith is female - the undeleted redundant data will still imply it. 
A degenerate case of this is when the same datum tl f o mented twtc< - dtteting one of the 
instances of it obviously has no cf fm on the other. So, what ii i>e«i«d is some manner of 
wiling whether a deleted datum is $tiH implied bf thed«a-ba«e. THr U simply ordinary 
redundancy checkings Temporarily *dd thenar c^t^ and see if there 

is an inconstsoney. ff *het%n then the d«a-l»jewR«oat thrdetaed datum jtiH implies it. 
Having detected trfft anomaly , one irin the same< titthftlon that occurs when an 
inconsistency is deteaM ttuWng th* add (or person) 

must be called to drtermmehew toresdwrWfrgi J 
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5.3 Issues relating to CJEV primitives 
W.1 Logical Consistency 

The discussion of "consistency checking" througbouuhis document has been 
based on the supposition that whenever a representation's inference procedure signals an 
inconsistency, then, there is in fact inconsistent daft in the : .dau4pe (such as having both 
"Jane Smith is in the class FEMALES" and "the SEX-OF Jane Smith is 'male'"). However, 
it might be the case that the inference procedure occasionally signals inconsistencies when 
there are in fact none. If the inference procedure an not b* relied on* then the task of 
consistency checking is made that much more difficult --.^ such cases, it is not clear 
whether a signaled ^consistency is really due to inconsistent da*a or is just an artifact of 
the inference procedure. Thusjt is useful to be able to show tbavan inference procedure is 
logically consistent" ~ that it never sigiuU spurious inconustencief 

For represenfttions with complex ..ad. I hoc mference procedures, it is very difficult 
(if not impossible) to show that they are logically consistent. For CE. it is easy to 
demonstrate logical consistency because the behavior of the inference procedure is tightly 
coupled to the meanings of the CE primitives. Consider: All the inference procedure does 
is put labels on classes. These labels represent assertions (by the inference procedure) about 
which extensional object are (and are not) in the various classes: Now, a spurious 
inconsistency could be caused only by putting a "wrong" label Oh a class (e.g. putting a *x 
label on a class C when the dafa-base does not anywhere imply that V is in 'C'). However, 
section 2 shows that the inference procedure only propagates, the "rigM" labels: For each 
primitive in section 2, its label-propagating behavior is directly derived from its meaning. 
Thus there is no place where a spurious inconsistency can be introduced. 
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Note that a demonstration of logical coflitsferfcy require* three things: A logical 
semantics which describes what the representation means; a procedural semantics which 
describes how the inference procedure behaves; and some form of connection between the 
two to show that the behavior is in fact compatible with the meaning. Trim* it is impossible 
to demonstrate the logical consistency of a representation which does not have both a 
logical and a procedural semantics. 

5.3.2 Logical Completeness 

A representation can be said to be toglcally complete iff every possible 
inconsistency in the data can be found by the inference procedure (given enough time). It 
turns out that CE is not complete ~ a simple example is shown in figure 5-1. Here, it is clear 
that 4 A' and 'C contain the same objects (since b*th *A* and -XSf must contain exactly the 
same objects as I"). Hdwever, It is not possible to derive ah inconsistency suiting with the 
labeling that some object V is in 'A' but is not in t? - none of the propagation rules can 
be applied. 

Now, all knowti complete inference procedures for stiff idenrty Hen representations 
(e.g. those containing at least the Boolean connectives) end ujj> tatitijg tune proportfohal to 
an exponential function of the size of the data-base being used. Indeed, tCarp [1072] 
presents mathematical evidence to the effect that ajy. compteemiference procedure for any 
such representation must take exponential time. Thus for a moderately Targe data-base, 
having a complete inference procedure 'it of absolutely no benefit unless one is prepared to 
wait a very very long time while it rtins. 
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5.3.3 Practical Completeness 

Since logical completeness is so impractical, the best that can be hoped for is that 
a data representation be reasonably complete and efficient with respect tp the Kinds of 
structures that are most commonly encountered in the data-base. Lacking any large-scale 
empirical evidence as to how CE and other represemaUons perform in practice, the issue of 
practical completeness can not be resolved; thus no. ,n^t^,is ;n made of it in section 6. 
However, it is reasonable to say that CE processes certain "simple" constructs (for humans) 
such as "all A are B" in a computationally simple manner, and that the degree of 
computational complexity involved in processing a C£ expresiion correlates reasonably well 
with the expression's intuitive complexity, Whether or not this is of any importance 
remains to be seen. 

W.4 "General" vs. "Specific" Information 

A s^nificant feature of CE is that the same set of primitives is used to represent 
both "general" and "specific" information. An example of general information is: "Every 
person has a unique sex, which is either female or male* (see figure 3-lQa). An example of 
specific information is: "Jane Smith's sex is female" tfigure 2-4a). Npw, since the CE data 

representation and inference procedure make no built-in distinctions between "ge" eral " * nd 

"specific," the single inference procedure c*n , detect aH of the following kinds of 

inconsistencies: 

(1) Specific vs. General (or vice versa): Some piec^ of specif k; data is inconsistent with the 

general information. For example, the specific information tluit "jane Smith's sex is Mary 

Smith* is inconsistent with the general inf wmatiop that "every person's sex isekher male or 
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female." (Presumably the general information contains a taxonomy such as figure 3-1 
which has SEXES be mutually exclusive with PHtSIdAt^1iJi6T*S. Thus it is known 
that the object *Mkry-Smith* ts not one of the objects *ma£* and female*.) 

(2) Specific vs. specific: Twb pieces of sj^ifk data ax* mutually inconsistent. For 
example, *jane Smith's sex is female* and "J*ne Smith's sex is male." Each is consistent 
wijh the general information, but they are inconsistent with each other. 

(3) General vs. general: Two aspects of *ihe general informatiw are mutually inconsistent. 
For example, Hie general information might contain "an draft evaders are criminals" albng 
with "some draft evaders aii'fteioes* and *no heroes are artmiriaU" Such an inconsistency 
could arise '(for instance) if the general infornfttion came from different sources. 

It is significant that the same procedure which detects incoritiste^tcies involving 
specific information can also detect inconsistencies in the general information. For a given 
data consistency-checking application, this feature ma*es it sigfWflcartdy eaiietHbo "debug" 
the general Wf ©rmatidft whie* Wta be used toch«k me ini30mtt»g specific mfbrmation. 

5,%b Incomplete Information - 7 

In addition twr^hf a^ 
for general and sbectffc information facilitate the rwrgsentmr tf incomplete information. 
For example, one mifht hav* the^ mforniition that "ail drift WMdi^ii* aiminlte*' without 
having an exhaustive Mst of all the draft evaders. In some representations, the ohty way to 
state that "aH draft ev«ch^ a^iwlmlhal*" 1i to take itefa*']$&W&*ti *>tiHifo-M to 
state individually for each dnt that he I* a^fWrmil AhO, th* bnlf way to ariiwer'ttie query 
"Afe all draft evaderr criminals?" fc'#e^^ In &e data- 
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base and check if he is a criminal. It so happens that of all the representations listed in 
section 6, the o nly ones that can both make statements and answer queries about incomplete 
information are those which use the same primitives for general and specif ic. 

53.6 Quantification -- Explicit, Implicit, and Sloppy 

For those representations which dp represent both general and specific 
information in a similar manner, some means ii needed for distinguishing the tw 0f For 
example, "(HAS PERSON SEX)" might mean (in some Hyj^he^al representation) that 
some person has a sex, or that every person has a sex, or that all person; haye the same sex. 
etc, etc There seem to be three different techniques for handling the distinction between 
"general" and "specific" 

The first is to explicitly differentiate the two by associating different quantifiers 
with each (or by associating a quantifier with one and leaving the other as unmarked). In 
CE, explicit "general" quantification of objects is. provided, via the typical-member 
constraint (with all unquantified objects being "specific")., In contrast »4his, mathematical 
logic provides explicit quantifiers for both "general" and "specific" (i.e. "V" and "3". 
respectively). 

The second technique is to use primitives which involve implicit quantification. 
In CE, most primitives are defined in such a manner that they can be apjpUed to general 
classes as well as specific objects. In some sense, CE only deals with general classes <* a 
specific individual object is represented as a clasi which happens to be constrained to 
contain exactly one object. To see the quanUHcaUon implicit in the primitives, consider 
figure 2-8a, which states that PARENTS-OF and XHILDREN-0F are inverse? of each 



78 

other using one primitive constraint (and no explicit quantification). In mathematical logic 
(which ntilces all quantification explicit), the same tiling wouW be written as: 
VxVy[PARENTS-OF(x,y)«CHlLDREN-OF(y,x)I *' 

The third technique for handling the distinction between "general" and "specific" 
is to ignore it. This can lead to ambiguities such as the above "(HAS PERSON SEX)." 
This technique will be ct1led*sl0p>y quantification* Note that only those representations 
which do not have a logical semantics fait prey to steppy quantification— having a 
semantics prevents one front ambiguously using *uch words as "has." Woods tl9753 and 
Hayes fl9?4j point out several kinds of sloppy quantification. Of course, those 
representation! which db not allow both general and specific inf ormation have no need for 
any sort of quantification in the first place 

5.3.7 Worlds and States 

One major difference between CE and other representations lies in CE's use of 
the "world object" and "world class" constructs. A wbrla-objaet represents a (partially- 
specified) state of the universe, and a world-class represent! a collection of these. A 
significant feature of CE is that it treats worlds as entities which can be manipulated in the 
same manner as simpler objects. That is, world-objects may be quantified; may participate 
in binary relationships, and may in general be used in all the ways that other objects can. 
Thus it is possible to reason about worlds in addition to reasoning "within" them. 
Appendix C shows how the uniformity of this approach makes It relatively easy to reason 
using "knowledge about knowledge" (such as "Billy knows who Jane Smith's real father is, 
and she doesn't know that he knows.") This may someday have applications for intelligence 
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data-bases. 

M Iuues reUtinf to Q£'> non-primitive expreMtons 
5.4.1 Representational Completeness 

The sections on "logical completeness" and "practical completeness" discuss the 
completeness of the inference procedure (in terms of how thorough it is in finding 
inconsistencies). This section discusses a different sort of cpmplet^ss: ^ repre$enuti9n is 

"represenutip^ally , complete" for » . j^t5r*w . fPJPfiPI^MtiQn-. iC -ji^l t^aC.^J^p- jf^lffl. fWW?. f< 5" the 
application can be encoded aj strucujrei in the reprejervtation. , ;j Tb.|(k.. fike ;? prfiCtical 
completeness, is difficult to judge in the absence of a^ignifkant amount of empirical 
evidence, and in any case it is relative to the particular application. One purpose of section 
3 is to show that various useful macro-structures can indeed be built out of thf CE 
primitives. To recapitulate, the structures nxnttc^ in section 1|^ j^onomies, Boolean 
connectives, distinct £b jects,, cmri^tiye relation*. N*ar,y r fttiW&bi*, &*h nfa&WpK- *° v,erse 
relations, hierarchical contextt, naive probability t aad templates. It would Uke loo much 
space Jn section 6 to cpjnment or hqw adeojiateh>.each ol-itb^f^ioned. t represep|ations 
handles all o£ '.$*£ construe^ - only a few will be jnentioned for^ach. ,?„ 

"■■-'■ '*' '*' "■•■■'■■■■ j ■■%'- ■■■' -. 

M.2 Procedural Attachment :< , ,-,^ , • - » ^ 

The coii^r^ f r^m^secUon 3 **&&* a|e, 4isttd.ab^e f |*)§ oo%whieh. £E handles 
fairly well. In addition,, the^e, are other constructs wJ||cb b CE ; doeij;|pt W^»nt»y s h»ndlf 4>ut 
which are important in some of the other representaUons discyssfd in section 6. One of 
these is "procedural attachment": Thit allowiexecuuble procedures to be attached to 



80 

various pieces of data in such a manner that accessing the data causes the appropriate 
procedures to be invoked. 

For example, some systems handle additions to the data-base by running 
procedures which are appropriate to the specific datum being added, and similarly for 
deletions and updates. This will be called 'antecedent" processing, in that having the data 
(to be added or Whatever) triggers the procedure. Another example is that procedures can 
be used to derive certain kinds of data during an inference -- the appropriate data is 
computed by some procedure (using perhaps otfier data In tfie data-base), instead of 
actually having the data be explicitly present Tliu wnt be cafced "consequent" processing, 
in that needing the data triggers the procedure. 

5.4.3 Events 

Another important representational construct wnfch €t does not currently handle 
is the ridttdn of ^events": Simply put, art event corresponds to some change in the world 
(Which might have to be reflected as a change to tl& daia-bas*). NbwVCE does have 
provisions for accomodating changes in the dMa-base (fee seiction S&3 on additions, 
deletions, and updates). However, CE currently has no explicit representation for the 
meaning of an event. For example, a representation ot the event "getting married" should 
presumably say something about what must be true before the event can take place (e.g. in 
the USA the Wings%ettmtmarrtid mtfst be of dlf fefint W&mW r^i?ii|eable age, and 
not already be married), fh^ddttc*, ^ what changes 

as a consequence Of the event - for *gettihg married** it is presumably necessary to change 
the beings' marital statu! and to Indicate that they are no* spouses. 
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One way of representing all this ij >£o represent an eyeot as the difference 
between a "before" state and an "after" state. So .the J^rersj^flf !g«tt»$g married" would 
specify the above preconditions (different sexes, etc.), and, the after-state would specify the 
postconditions (marital status is 'mapied', etc*) .l^JhJs be called the "static" approach for 
representing events, in that the event (which. consists of two ^^ s^eianjd a |ransition between 
them) is specified in terms of the two static states, and the nature of the transition is 
derived from this. 

Another approach will be ailed the "dynamic" approach: In this one, the before- 
state and the transition are specified, and th^ after-ittt* Just follows as a consequence of 
"doing" the transition to the before-state. The sttndard way of doing this is to have the 
transition be some procedure which is executed to transform the before-sute into the after 
state. Since the static approach can a lto be said to specif v a pjticedure (implicitly* in terms 
its effects on the before-state), the defining character^, of the dynamic approach will be 
considered to be that the procedure which spectf^thf^^wsilioo j* » black bo$; The 
structure of such a procedure is unimportant since ..we are only interested in the effects it 
has |n terms of transforming the before-state. 

This is hardly the place to enter a discussion $f the philosophical nature of 
events and the "best" way to represent them. Howey% ^ 4s reasonable to include some 
discussion of the technical advantages and disadvantages of, the static and dynamic 
approaches. A major disadvantage of the static approach has been termed the "frame 
problem" [McCarthy & Hayes 1969]: It is not sufficient to just specify the differences 
between the before and after states -it is also necessary to somehow specif y that nothing 
else changes (unless perhaps it is a necessary consequence of the specif ied changes).. For 



82 

example, getting married presumably does not change a person's sex (or parents, or blood 
group - the list of iwhat does hot ichange is clearly too huge to explicitly enumera^l. 

The dynamic approach does not have this difftc^lty since the transition 
procedure presumably knows exactly what ajp«t$ of the before-state to change, and 
anything it does not touch is ipso facto unchanged. However, the ofcck-bbx nature of the 
procedure makes it much more difficult to reason about events (as opposed to just 
performing them). For example, using the dynamic approach it is impossible to decide if a 
given after-state could have fesulted from a given event - it is not possible to run the 
event's black-box procedure backwards* in an attempt to derive a°oefore-staie which could 
have produced the given affer-state. 

Given that the two approaches are good for two [different things, an obvious 
solution is to have both. The problem with this is that it is not generally possible 
(currently) to show that a given transition procedure coffettty implements a given static 
description. That is, it is oolite 'possible that the dynamic ciekHption arid the static 
description of purportedly the iame event* are hot in fact e^ul^erlrfo insure this 
equivalence, one either needs a powerful procedure-analysis technique ~'$ seelff the dynamic 
description does indeed satisfy the static one) or an equa% pe^er^l £ pr^edure-synthesis 
technique (to derive the dynamo description'given the static one), ^toth of these are quite 
beyond the current state of the art. 

5.4.4 Arithmetic 

One concept which the current formulation Of Ct has a f reat deal of trouble 
with is that of "number* It is possible (but quite unwieMy)Wexpreis numbers m'ferms of 
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cardinalities of classes, such as is done in certain axiomatic fon^j^^^ arithmetic. Jt is 
aJ*0|Wb»e to fo^^a^h^ in Peand's 

axiorof^^her, of t^mim^jmmf^m ^m^'^>4mmp^M?ii^^ uwafurai 

s f^ mp^h>jma^and f ^^ ;h,,, 

In addition ta the m fmmi" mm& mMWmt&t&WMtbe "fMwedural" 
approach. Many wmm^ifaidimMt lk mpk fomlm fi&Mkwk&mtow fm the 

arithme^ilifcmio^ «»ese 

represem#iaiq>/ind-*dtf^^ ftttt of 

course M*h *jr*«iu do not h*M to - tte Ulif ^WiMMMhm' thff OfttH*** »F«*h*» UliDt 
arithmetic. ■ v.--T^ ^.f^o^>; :, %:;t^% :■,,-.;.; ^-^ ,:w-. : --. 
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6 Some Representations 

The section discusses several representations in terms of the issues presented in 
section 5. No attempt i* made to explain rtw cHfftrent representations in great detail - 
anyone desiring such details should consuU ttre Wfeliography. Ttre representations 
considered here are chosen from those concerned with data-bwet (&BTG, Codd's relational 
model), mathematical logk (fim^i^er predicate calculoj), «gniHve sirmilation (Quillian'i 
semantic memory), and artificial intett^errce (PbrmeHike hmguages, semamic netwt>rks). 
This is a reasonably representative simple, iwt it ignores some of the newer ideas, especially 
those which currently l*ck sufficiently concrete documentation (e.g. MERLIN [Moore & 
Newelll974], and "frames" [Winograd 1975]). 

6.1 An Aside: "Assertions" vs. "Networks" 

Before proceeding with individual discussions of each of the above 
representations, it is instructive to group them into two broad classes; those which represent 
data in terms of "assertions" and those which use "networks." Syntactically, the difference 
between the two is obvious: Network representations (such as CI) encode their data in 
some kind of graphical network, white assertional representations prefer a linear notation. 
For example, figure 2-4a is a network representation for "the sex of Jane Smith is female." 
A corresponding assertional representation might be "(SEX-OF Jane-Smith female)". In 
general, tokens which, appear in assertional notations correspond to points (often called 
"nodes") in network ones. In addition, expressions nested within assertions can correspond 
to network points. For example, an alternative assertional representation for figure 2-4a is 
"female - (SEX-OF Jane-Smith)": Here, SEX-OF is a function and the result of the 
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function is represented by the whol« expreuion "(SEX-OF Ianc-Smrth)". 

Given this kind of rather direct syntactic correspondence between assertional and 
network notations, it is perhaps tempting to say that the only '^faeiw&jtmmMiam^th* 
syntax used, and that there U no reason other than periopal t»^« for preferring ona to the 
other. Indeed, when "network" information is entered Into a computer the network is 
usually first encoded into some linear nottdon which the compi^r can easily read. For 
example, various parts of CE have been implemented in LISP, which require j that 
everything be encoded as parenthesized, expressions. Then agau^a common notation for 
LISP itself involves drawing the parenthesized u expressions .ai^jietw^jrks!^^^*:^^"^' 6 " 1 - 
Thus it is clear that any network can be recoded as a set of assertions, and yice-yersa. 

However, there & more than a syntactic difference between networks and 
assertions when it comes to processing them. Network notation* aim visual connectivity to 
emphasize the log! connections between things, and thus are good for representations such 
as CE which operate on the basis of tracing through such c^nec|ioiM. Jin the case of CE, 
the local connections provide the paths along which lab«U propagate, and there ii no other 
kind of processing). Assertional notations, on the othfr hand, emphajii* the syntactic 
patterns of the expressions - they are usually processed yta some kind of pattern matching 
which compares two whole expressions at one tinje (instead of h»W«g to do it in terms of 
local connections). Of course, at some level the pattern matcher must uj< "kxal connections" 
(eg. the fact that two tokens are equal), but the usej, is not coocerned with this level of 
detail. 

Thus in the discussion which follows, jyjtemj will be called "network-based 1 ' if 
they process information in terms of local connection*, aj»d "asj^erUo^-basjd." if thav use the 
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rather more "global* connections provided by pattern-matching, 

6.2 Assertion-based systems 

6.2.1 Cbdd's Relational Data Model [CoddKfrOJ 

The primitive construct in Codd's representation is the flat N-tuple. and a 
relation is a class of such N-tUples (just as in CE a binary relation is extenstonally a class of 
2-tuples), The "slots- in each tuple contain atomic values (such as character strings or 
numbers) - they do not point to other tuples. Tuples are accessed via pattern matching - 
the standard accessing operation is to create a new relation consisting of all tuples in an 
existing relation (or cross-product of relations) which match a given pattern. The user 
interface to a relational data-base consists of a high-level query language, which gets 
compiled (or interpreted) into a series of pattern-match requests. 

The logical semantics for this system is the "relational algebra," which describes 
how relations may be rneanihgfully subsetted, projected, etc The procedural semantics is 
embodied in the pattern-matcher which implements these operations; Thus there is a 
reasonably close coupling between the logical and procedural semantics: 

As for consistency checking, this an area of current research. Much of this 
research is devoted to developing additional representations which can be used alongside 
the tuples. One reason that some other representation is needed is that the tuples 
themselves deal only with "value" objects such a numbers and strings - there is no direct 
way to refer to real-word objects (such as persons). 

When performing additions, deletions, or updates, it is necessary to do special 
processing to insure that the assumptions of the relational algebra are not violated. For 



^ -^Bi^v. ^ vw%.* " -^j*«e«*ifS« v-^?'- v&s-a ;^r T **» ■ 



87 



example, during additions it is necessary to check tha||bi?|u^K^>f *4dfd doe* not 
duplicate one that is alrea,d| there, J)uringde^^ delete those 

tuples which "depend on* the one being deleted^ (The notion qf "depend on" actually turns 
out to be quite complex,an(l is a topic 0/ CMr^n^ppp|rc| ijnte|$s$4 .., t| . . . .. 

It is easy to show that Qodd's schejne^Joe^jty,^ is 

consistent because the procedural semantics (ai jmpJememed in the pattern matcher) is a 
direct reflection of the logical semantics (the relational ajgeora), ^^course,^ may, : ** yer y 
difficult to show that a given implementation of a pattern matcher co r^y e fj y implements the 
relational algebra (especially when co^nplex pp$n|^^ idea is 

simple. As for compteteneu, since the relation|y|at^^ of 

a finite stjt of rela^ons^ea^ b^ing a finite ^ejajs^ ^j^t}^^u^k0, f ^ exhaustive 
enumeration procedure will be a wmptete one. , !n ,, ^ 3 • 

Now, the major limitation of Codd's^s^jBfnje u that it ba» absolutely no facilities 
for expressing general information. Since the. ce*t of tb* U*ue,s ; tij*qMMed in Mictwn 5 
depend in some way on tteusj of general ^foHna^Qn,UUpwiUe»s to discuss them in the 
con ^* xt <* this re^reae«|s>$km. ^^rf^t^^^pt^l^g |©j|sjs|ency ^hecjyng f m*i«i©ned 
above, another reason for ijeed^ a J^ra^ 

constraints is that they usually ; : lpvpJy^j|-^^o|ni||^ F|f ^anipte. "a^pjrsons have 
a unique sex, one of •male' or female" is one .s^^geiieml^coqstaint,, In summary the 
ability to handtejreneraj information is the jgjjgj I unction*! dtf fe^e^be^e^CE and 
Codd's scheme. 
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6.2.2 Planner-like Languages 

The Planner-like languages are the result of one approach for adding general 
information to a Codd-llke data-base. These languages were developed for artificial 
intelligence applications, and indudttlanner [Hewitt tffrS], fcohniverlMcDermott & 
Sussman 197S1 COL tPople 19721 and <£A4 Qtuttfibn et al B&21 For purposes of this brief 
discussion, no distinction will be made among them <eveh though significant differences do 
exist) -- the discussion is in firms tit the general approach, not in terms of some particular 
incarnation of this approach. 

The>e are two components of these representations. The first is an assertional 
data-base which is essentially like Codd's. The differences are minor: In Codd's scheme, 
the tuples are "In" the appropriate relation, While in assertional data-bases the appropriate 
relation is "in" each tuple (by having the first slot in the tuple be the relation's name). Also, 
assertional datarbas* tuples may be i¥e^ted, such as 
"(COLOR-OF BLOCK1 ®ARK jfi-D))". 

The second component of a Planner-tike representatton Wandles the general 
information. This if dbiie using procedural attachment as discussed m section 5.4.2. The 
procedures are attached to pattens; iuch as ^OLOR!-0F & 3^ When such a pattern 
is successfully matched against an assertion in the data-base, fhe pattern's variables (here, X 
and Y) get bound to the appropriate pieces of the assertion (e.g. X - BLOCK1 and 
Y - (DARK RED)). This binding process is the way a procedure nfteives its arguments - 
the procedure has access to the bindings of its pattern's variables. 

Both "antecedent" and "consequent" processing are done using attached 
procedures. For antecedent processing, there is one set of procedures for additions, and a 
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separate set for deletions. When a datum is about to be added to the assertional data-base, 
all "addition" procedures attached to pattern! which majch the [ da^m are executed. These 
procedures may in turn access the data-base, possibly causing ojher prccedures to be run. 
Similarly, appropriate "deletion" procedures-fare re^ecyied when a. datum is deleted, for 
consequent processing, there is another set of procedures for generating, assertions which 
match a given pattern. For example* a consequent procedure ^t|ached to the pattern 
"(PRIME ?N)" might generate the prime numj*rs,(il 9r ; . ! f^H^,Jb^|.;li^i(Mil}l)e to store 
them all directly as assertions of the form (PRIME 2), (PRIME 3), (PRIME 5), etc.) In more 
complex cases, the generating procedures can themselves access the data-base, possibly 
invoking other procedures. 

Now, since all processing within a system based on Planner-liKe languages is 
controlled by the attached procedures, the "procedural semantics" (i.e. behavior) of the 
system is determined by the user who codes these procedures. Thus little can be said about 
a Planner-like system's behavior "in general," because little an be said "in general" about 
the behavior of any programming language. As a consequence of this, .there \y no built-in 
logical semantics for the meanings of the assertions. For example, the assertion 
"(NOT (COLOR-OF BLOCK1 GREEN))" might mean that btockl is not green, if the 
relevant procedures have been coded to treat "NOT", aocord^j|,,» its customary meaning. 
Thus it can be very difficult to determine the global meaning of a given assertion, since it 
depends on the whole structure of procedures installed in the |ys|em (which may be very 
complex). 

Since there is no general logical semantics, there can be np general way of doing 
consistency checking. Also, notions of "logical consistency" and "h?gical completeness" are 



90 



inapplicable without a logical semantics. Consistency checking can be implemented for a 
particular application by having the "addition* procedures do whatever checking is 
necessary before a datum is added, but this requires that afl the information about what to 
check be coded directly in the procedures: Thus when new checks are needed it is necessary 
to change all the relevant procedures, which an be very difficult Also, it is impossible to 
do "genera' v *- general* consistency checking, since the general information is implicit in the 
structure of the procedure* and is not directly minipulable. 

Some Planner-like languages (eg. Micrc-Planner feussman eta[ 1970]) handle 
universal quantification by explicitly iterating through the set of relevant pattern-variable 
bindings. For example, the notion of "every dark red object" is represented in a procedure 
as a loop which iterates through all of the bindings of X for assertions which match 
"(COLOR-OF ?X (DARK RED))". This of course means that the class being quantified 
over must be reasonably small - "every person" would take too long, and "every prime 
number" would take infinite time. As a concrete example, the query **Are all draft evaders 
criminals?" is answered by enumerating all of the known draft evaders and then checking 
each one for criminality. Not only will this take quite a while if there are many draft 
evaders, but it requires complete information concerning exactly who aft the draft evaders 
are. Robert Moore [1975] is currently researching the problem of handling incomplete 
information within a Planner-like system. 

The languages <QA4 and Conniver do provide a mechanism for handling 
multiple worlds. Each world-class (called a "context") is implemented as a list of iayers." 
Each layer describes the differences between itself and the context represented by the 
following layers in the list. This implementation makes it easy to^riate a hierarchy of 
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contexts without unnecessary copying - only those assertions wjhich are different need be 
recorded. Using these contexts and the appropriate procedures, it 4* easy to represent events 
dynamicaUy - an event procedure takes a befojfrcon^exMndje^s,^ which 

is the bffore^context with an additional layer representing the chanfes^iue to the everu. 

Howeyer, unlike CE wortd-das^^cwiK^^ 
is, it is not possible to reason ... abouf contexts. For example ills usually not possible to 
determine whether one context is a itrone^,yjrs^ while in Q£ this U 

easily done (by showing that the first worMidass is r a 5 sjul|clas* / o| the jecen^}, 

With respecj to the.topici listed uoder^reDre|f^ sectfon 5. 

the only one* which Planner-like languages bundle well pmf&ffe relationsrand exceptions 
(such as "AH birds cap fly, except a few wch as pengmnt apd ostrichet," which CZ can 
handle using probability). N-ary relations (t*. assertion!) are primitive* in the system. 
Exceptions can be handled using the context mechanism: CceaUng a new context from an 
old one by adding an additional layer implies that the new .one || exactly like the old one 
excep t where explicit differences are noted in the new layer- Not? that, the notion of 
•subcpntext" (U a coii^xt grown from another one . by jada^ng^ layer), is quite different 
from the CE notion of one world-class being a su bclass of another. In CE, the subclass 
may be different frc^ tbe superckw by being stronger (U. by >npwinj more"), but it must 
be consistent with the wpercUw (i.e. it may not "knpw different! On the other bano\ in 
QA4 and Conniver a subcontf x* m»y be arbitrarily .fp; fereni^fjoni |& s^percpntext. 

In summary, the major function^ 4K$fflPf%N3Wj££}$*f?. ^* nr l*r* ike 
languages are (I) that CE facilitates consistency checking, and (2) that Planner-like 
languages facilitate procedural attachment 
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$.2.3 First-order Logic and Resolution 

In the format usually used by humans, the primitives of first-order logic include 
variables, constants, Boolean connectives, N-ary functions frf. SUM-OF(x,y)X N-^ary 
predicates (e.g. GREATER^H&Nfr.y)), and explicit quantifiers (V arid 3) "Resolution" 
[Robinson 1965] is the machineoriented inference procedure commonly died with first-order 
logic It requires that expressions be converted to "Skotem conjunctive normal form," which 
basically involves transforming them to remove the constants, the Boolean connectives, and 
the explicit quantifiers. Given a set of expressions m this format, the- resolution procedure 
uses "unification" (a pattern-matcher) to combine two existing expressions and thus generate 
a new one. This new expression is then added to the set of expressions, and the unification 
cycle repeats. Usually, the cycle is repeated until an inconsistent expression is generated - 
as with CE, this implies that the original set of expressions was inconsistent. 

Thus resolution (like CE) is oriented towards consistency checking. This requires 
first-order logic to have a logical semantics, and requires resolution to have a corresponding 
procedural semantics - there are in fact formal arguments which demonstrate that both 
these conditions do hold. Furthermore, resolution is known to be both logically consistent 
and logically complete. 

Like Planner, first-order logic does handle general information. Unlike Planner, 
the general information is expressed in the same mariner as the specific information. 
Furthermore, first-order logic an handle incomplete information - it is hot necessary to 
have information about alt the members of a class in order » make inferences concerning 
that class. 

So far, first-order logic and CE seem quite similar - it is now time to look at the 
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differences. For one, first*order logic has certain formal difficulties with expressing the 
notion of equality (U. identical objects). These <Mtf Acuities lead to various attempts to 
extend resolution to handle object identity in a more natural manner. In CE, equality is 
simply a degenerate case of subclass, which is a primitive Another difference is that first- 
order logic can handle arithmetic (and indeed most of mathematics) - it uses the "formal" 
approach discussed in section 5.4.4. 

However, the major functional difference between first-order logic and CE lies in 
GE's use of worlds - first-order logic has no analogouf construct. This makes it very 
difficult (if not impossible) for first-order logic te handle hierarchical contexts, knowledge 
about knowledge, etc Some AI research has been done on the issue of adding worlds to 
first-order lc^ic, notably by McCarthy <e.g. McCarthy & Hayes [1969]) - this Is still a wide- 
open area. 

6.3 Network-based systems 
6AI DBTG and COBOL 

The local connections in a DBTG network [Codasyl 19713 a?e the access paths 
along which a COBOL program may trace in order tcj.ge* access to the various records in 
the data-base. Thus th* interaction channel (Le. the manner, in which the "local" data 
structures interact to make more "global" ones) consists, *of the particular COBOL 
procedures which access the network. 

As discussed above with respect to Planner-like languages, using arbitrary 
procedures as part of the interaction channel means that there can be no general logical 
semantics for the representation. That is, what a particular piece of data-structure "means" 
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is totally dependent on the detailed behavior of the procedures which access that data. As 
above, if a representation tack* a toxica! semaniiet then it h impossible to have a general 
consistency-checking procedure for iKsim* such a procedure obviouily needs to know what 
the data-structures mean in ord«r to ten if they are cornl mm). In DBTG, almost all 
consistency checking (done at the time of an addition, deletion, or update) must be explicitly 
coded into the particular programs which do the eddtttom, etc. Unlike Planner-like 
languages, DBTG has little provision for procedural attachment. This means that the 
COBOL programs lend to became rather monoHtWc m new features are added to a DBTG 
system - there is no way to modutafiy attach new procedures to the Relevant data (at 
opposed to combining them aH m one monolithic program) 

There are of course furthet difference betw e en DBTG and other lepresentations 
(such as Codd's), but these are irrelevant to comparing DBTG with CE. The major 
difference between DBTG and CE is that virtually all of the interesting information in a 
DBTG system ("general" information, information about what the data-structures mean. 
etc.) is buried deep within the particular COBOL procedure* Unread df being more 
directly accessible (for purposes of consistency checking, changes, etc) H is of course true 
that DBTG can do anything that GE (or any other dtta4»* stheme) can do. but only 
because COBOL is a Turing-universal p rogr ammi n g language - DBTG does very little 
towards making such universal power more tradable to use. > 

6.3.2 O.uiWan'j Semantic Memory 

One of the first network-based representations wai Quiflian's 0967] model for 
human associative memory. The "iemamfc memory" ooniitts of a *« of nodes representing 
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"concept*" (such u "dog", "mm". "•**", «£•) connected by linki representing "associations" 
(e,g, there might be link* «wne|tiog "e»t" to both "dpf " and "meat," presumably indicating 
that dogs eat meat). The memory is accessed by taking tswo concept* and finding the 
shortest path of associations connecting them. Thus given "dog" and "meat," the shortest 
path might be the one through "eat" 

This a very much m the spirit of piychpiegicai word^awociation tests, and is not 
at all meant to be a model of mprrstrocturad -JJJpjtaiJf tiiinlMigr lor v jpajapUvtiie above 
example could just as well mean -^eWfa.. mkA&i&fo:W*siwm meanitif mother 
than that of pure association. Now, be^,# ptjchoiog4©»f «^od*^£b#ianV system finds 
the shortest path in a psychologically plausible manner. The system propagates imrkers 
along the links breadth first (in parallel), starting at the two given eoneepts ("dog" and 
"meat"). Thus the place where these two "wave fronts" internet i* guaranteed to lie along 
|he shortest path between the two given concepts, The osy«hc***at plausibility of this lies 
in the fact that it can be accomplished by neuron-Wee «e»U working in parallel. 

Oearly QuiWUn Vvcheme is too umtructured to bfnaefi»Hn,a data-base, It is 
included here because ft exemplifies label propagation and some ol*«r aspects of CE. For 
one. it does have a kind of logical semantics - the ,110(100 of "^shortest path" can be 
rigorously defined m terms of gi^ theory, The piweaural jenwuic* is straightforward 
(parallel marker propagation), and the connection between the logka4 and procedural 
semantics lies in showing $hat p^lW propagatm indeed wsults in finding the shortest 
path. 

Unlike CE, the critical aspect of Quillian's scheme is the timing of the 
propagations - if they are not done strictly breadth first then the first connecting path 
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found might not be the shortest. In CE, the order in whkh thing* are done it irrelevant ~ 
a different order may cauieifcbel cotttiento occur it a different pome In a CE net, but it 
i» only the occurrence which matten^not theptttfculw loation; 

6.3.3 Semantic Networks 

Having introduced the idea of an expWdt network of totfcs and nodes to encode 
meaning, QjiiHian and other* tried to apply soch netiwkr » Mrder problermQjillllan's 
0969] TLC system was ah attempt tt> do htttirafhirguagg c oi n p m eos tun ush% an extended 
version of hit semantic memory. T&C?* network consists of different kinds of Itnkj, with 
different rules for propagating markers along them. This prewnts torrftuiom $och at the 
above "meats eat dogs." ■ 

However, all existing semantic memory schemes %%. TOR US {Mytopouiot et al 
19751 OWL tMarMn f*H* and sTahtmanV HfTSJ fc*k a welWeftned k^kai ternanttcs. This 
leads to confusions such as the "tfoppy qusmifkatton" dUcosied in sectton 5. The 
"procedural semantic*" for a semantic network system lend* to.be embodied in some 
complicated procedure for tmversfhg the network, rather Met DBT<3. CgiflfianY original 
idea of well-defined paraHet marker propagation seems to Have been rejected a j being too 
tied up with a very naive view of neurophysiology, hating no beafthg On how things 
should be represented in a computer . From One viewpoint, C£ attempts to show that 
parallel propagation to inojrettme: cowputattonallr at wen « frucfctftoykaflv 
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7 Some History 

7.1 CE's Past 

As can be seen from section 6, research on CE has been influenced by work on 
mathematical logic and by work on semantic networks. From nuthtrruiiul logic comes the 
emphasis on having a well-defined semantics for all constructs. Also, a C£ "world" is quite 
similar to a logical "model," the major difference being that in Ipgic the models are not 
themselves manipulate objects in the representation, while in CE the world* are 
manipulate. From semantic network research comes Jhe idea of parallel marker 
propagation and the idea that everything should be specified in terms of local connections. 

In addition, the work which initially interested me in the idea of doing "semantic" 
computations using networks is Lamb's linguistic research into Stratificational Grammar 
[Lamb 1966, 19691 Much of the philosophical perspective which underlies CE is derived 
from Lamb, and so are some of the notational conventions (e.g. the symbol for CE's 
partition constraint is the same as Lamb's "ordered OR.") It is clear to me that without 
Lamb's influence the research leading to this document would never have occurred. 

7.2 CE's Future 

As mentioned in section 5, CE can not currently handle events, procedures, or 
arithmetic - one obvious possibility for future research is to extend CE so that it does 
handle these. Representing events is currently one of the hard problems in AI research - 
the clean semantics of CE's notions of "world" and "world class" may prove useful here. 

The appendices deal with several topics which are not as well worked-out as the 
body of this document -- fleshing out the details of these topics ("finding," 
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"implementations," and "knowledge about knowledge") is another task for future research. 
"Knowledge about knowledge" is especially promising because this topic concerns itself with 
the relationships among different world* (the "real w^/'the wolld'which represents some 
person's beliefs, etc), arid CE is a reputation in wltich ^^ about 

worlds. "Finding" is i topic which must be worked out in morVo^&^ta order for Ct to be 
practical for real database applications, and issues of "imjl^merif^tiin* are of course 
always important when Oftl is propping a rtew represehtaiibh. 1 beTfeV* that &£ is a system 
which can in fact be built Upon by myself and others, Ind thit it Is not just a pretty toy. 
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Appendices 
A "Finding" 

The body of this document is oriented towards consistency checking, which entails 
the ability to answer "yes/no" queries. This appendix briefly discusses three techniques for 
handling "find" queries, which are queries of the fora* "find aH objects V such that -.." 
Within the CE framework, the starting point for handling such a query is to construct 
(intensionally) the class which contains all the desired objects, and then to determine which 
objects are in fact in that class. For example, "Find all the children of jane Smith" would 
be answered by constructing the class T in figure A-J and then iinding all the objects 
which are constrained to be in T. Clearly, an object is in, %' if ^ajidonly if it is known (by 
the dau-base) to be a child of Jane Smith. 

The three techniques presented below are different ways of finding all the objects 
in a class such as T. Of course, if the query is "Find oneof ..." then the "Find all" 
procedure can be run until the first object is fouRd (afler which the procedure can be 
halted). 

A.1 "Find the..." using Object Identification 

If it is known that the "Z" class contains eKactlyi one object, then finding is quite 
simple. Figure 2-6b shows an inference for "Find the sex of Jane Smith." Here, the "Z" 
class is the object-class V (which is constrained to conH4n the single sex of Jane Smith). 
Starting an "-x" label from this class, the goal is to h**e V identified with some object in 
the data-base (in this case, 'female'). As described in section Z3, this identification occurs 
when a %x" label collides with an "-female" label .- this means that V and 'female' are the 
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same object. Thus the label propagation procedure has found the single object in the "Z" 
class. 

A.2 "Find an..." using Suction 

When the "2* dais contain* more than a jmgt* object, there is stiff a simple 
technique which can be used » do the finding. CemMer figure 5*1 and the query "Find all 
redwoods" The baite idea behind the "%wtfoh' 1 technique is to i&crt a generated *-g0096" 
label from the "Z" class (m thft case, KH)WCK>DS)/ and Then ^ 

the "-g0096" propagates. In 34, it is clear that the "^O0*B' wfft propagate down ihe 
taxonomy using rute(p2) until it re*th« alUhe (A Jem at the bottom (for example, the 
objects which are the known Sequoia Natkwttt Eark T e dwuodf ) - these objects are not 
shown in the Figure. 

It remains to be demonstrated that all the objects reached by the "-jg0O96* are 
indeed in the "2" class. Suppose tuch ah object fcafl it *c*>/)- were «& in T. Then it 
would be consistent to label "Z" with "-obj". This "-obj" label could then propagate in the 
same manner as the "*g0096" did, and thus reach 'ohf. Since an object-class such as 'obj' 
can broadcast an "-obj" label, the "-obj" and the "^Obf would coHWeat \bf, indicating an 
inconsistency. Thus «te original auumption that 'obj* w»$ not in "Y* U f aUe, so in fact 
every object reached by the "-gGOW' ntustbelh tttf1t*c1a& 

This technique is called ^uttiorT because the "Z* class tends oat "-"labels in an 
effort to "puff objects into it" flawing Hione a suction Ihferetice- the user is left with the 
problem of determining which database objects have beeh reathed by the "-fOOW. Prom 
the user's point of view, the d»t*-bas« cwuisti of a black bor with some classes being 
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accessible as "terminals" - the terminals consist of all the classes which the user knows 
something about In particular, the object-classes jr* which the user is interested will be 
among the terminals. So, what the user needs to do is to look at ill the object-class 
terminals and see which ones have the "*gQQ86" on them. Of course for a large data-base 
the user will need some sort of automatic monitor to watch the terminals and signal the user 
when an interesting label (such as the "-gG096") arrives. It is not difficult to see how such a 
monitor can be constructed (either out of hardware, or mpmttot a*€E system simulator). 

AS "Find all ..." using Reflection 

The one problem with suction is that it i» very incomplete -- there are many 
simple cases in which the generated "*g009&" becomes blocked and can not propagate far 
enough to reach the relevant objects. Figure A-2 shows a simple case of this. The network 
above the dotted line states that Jane Smith is a child of Mary Smith, and that Billy >nes 
is a child of Jane Smith. Now, the query is "Find all grandchildren of Mary Smith." The 
network below the dotted line constructs the class T. to be the children of the children of 
Mary Smith, and the task is to find all such objects irv*Z*„ Glearly. 'Billy-JoneV is an object 
in T. To see this in terms of label propagations, just start an "-Billy-Jones" label from 
Billy-Jones. By applying rule(b2) twice (along with rule pi) and then rule(bl) twice, a 
"♦Billy-Jones" will propagate to V, indicating that Billy-Jones is indeed in *Z\ 

However, suction fails to propagate a "-" label from T to 'Billy-Jones'. Starting 
from a "-g0097" on T, there are no propagation rule* which can be applied- What is 
needed is to start from the other end - some means is. needed fo have 'Billy- Jones' start an 
"-Billy-Jones" from itself. As described in section 2.2, it is infeasible to have every object- 
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class in the data-base broadcast an "-" label, since this would swamp the system. Thus the 
goal of the *Veftectte»" technique i» *mri*^*WM&WM&ftim-Wifai)tR : ^~Titoh 
the relevant ob jecta <suct» at *fti Hy- Jones') ami prebaWy seme of the irrelevant ones too. All 
objects reached by this label v^n then refket OaA their *-" Wjeh, and then only the ones 
which are truly in T witt evemutHy have a V !s^ propagate to T. That is, the initial 
label sent out from If is meant to select a smalt number of objects (relative to the lite of 
the data-base), and then these selected objects are sestet for membership Hi T by having 
them broadcast their V labels. 

The second stage of this process is already weft defined ^ as described in section 
2.3, an objeet-clats broadcast* an ;■ V label whenever it is reached by any other label 
(including the one sent out from T). It remains to de s c r i be lh e f irttrtage - the iabel to be 
sent out from *ZV This must be a new type of Isbeli esfios 2 ^*, • V, and "-" label* i*n be 
easily blocked. Call «bb> new label dttT kbd; Ufrt&e the V. '-", and "-" labels, the *?" 
label hat no reel semantics - a "?" label on a diss simpif means that the claw is somehow 
"associated" with the "Z" da». 

©no possible propagation rule fir T is to say that when arty con stntint detects a 
"?" on any attached ctaM-poim, is should p rop agate a T to all of its other attached dass- 
points. This rule results in a QjiillUrn-Uke "wave from" of T fctbeh, propagating out from 
the initial "Z* to everf class which is connected (t« the gnph-theoreric sense) to T by some 
path of constraint » and points. Since every relevant object <te. ones in *Z") must be 
connected to "Z" by some such path, this propagation tehnioue ir guaranteed to have 7" 
reach every relevant object (whlclrwtfl then reflect back *ts "** label). Unfortunately, this 
particular propagation rule will cause theT" to teach everr point in the network, unless 
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the network is really two or more totally disjoint ones. That is, everything is likely to be 
connected (via some possibly long path) to everything else, so this propagation rule is not 
selective enough. 

The details concerning a more adequate set of propagation rules for "?" have not 
yet been worked out -- that is why this material appears here in an appendix instead in the 
body of the document. 
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B Implementations of CE 

This appendix describes three possible implementations of CE. The first uses 
unconventional cellular hardware to do the label propagations 4n parallel The second is a 
modification of the first which uses an mjtf'mkxapnam^ feasible than 

the first with current technology). iThe third implementation is one that actually exists - 
the label propagation*, are performed using an ordinary general-purpose computer. 

B.1 Using Cellular Parallel Hardware 

The basic idea is to have each constraint in the data-base be an active processor 
which continuously looks at its attached class-point for patterns of labels which match the 
constraint's propagation rules. When such a match is found, the processor propagates the 
appropriate labels to other class-points. Each class-point is a register which indicates what 
labels (if any) are currently on that point Thus each constraint is a processor which reads 
and writes the registers corresponding to the class-points attached to that constraint 

The main limitation of such a CE machine lies in the number of labels which 
might pile up on a single class-point There are basically two ways to approach this. The 
first approach is to endow each point with a fixed number (N) of slots, each slot containing 
a pointer to a label. Here, the number of labels on each point is limited to N, but the total 
number of labels in the whole network may be much greater (since each point may have up 
to N different labels). The disadvantage of this scheme is that each of the label slots will 
be several bits wide (the tog of the maximum number of labels allowed in the network at 
one time). If this number is M, then each point requires N times M bits, and the constraint 
processor needs to be able to copy and compare these M-bit labels. This requires either M 
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wire* cpnneaing each constraint ^tt.^Kh : or..|^ t! psilntv.oc .««ip|f...jmMlt|[p|fi^L scheme using 
fewer wires. Either of these alterna»ves »* ,}l$&r$fo)K£%iii i^j# important to minimize 
both the number of wires and the complexity ctf each processor. 

The second approach allows both for Cewer wires and simpler processors. It relies 
on limiting the number of different labels in the entire, network (not just on each point). 
Assuming that N labels are allowed, then each ooint need ,onl| have 2N, bits worth of 
storage Each label U represented by two bits, whose fpii;.states indicate "♦" "-", "-", and 
"none." With this scheme, copying a label involves cjjaprinj; only 2 bits ^the b»|$ being 
indicated either by a pointer of tog^N bits, or ..bj n^ltipjexjngk Thrpugb £ all pofntsnins 
the "point bus" which sends re set signals to aJlo^in^jWHf h*ndljej the signaling of 
inconsistencies. A point will signal [an inconsisj^y Jf,it is told to set a, label's state to "-" 
and iu current sate is V or "-".and vice versa. 

The partition constraints and the object con*trainU dp not n«ed to know what is 
"inside" a label (being interested only in the ♦(-/- s||tes^ but the o^cpjisjrairtfs.aa h» ve 
to be able to decompose a complex label into iu simpler ccim^wnerus. Hence the system 
must contain one N-stot "label memory" which g^ves tM s s^uc^|pr e^ch an 

object/world pair, or an ordered-pair/worJd triple, The "world" J»« of each Jabel is a 
log-N pointer to the label in the label memory which i|4hf,pr|g^l label's world-tag. To 
allow access to thU memory, there must bt a "label bj|"wbteh gr^UJthe cpnjent* of the 
label memory for inspection by the constraint processors. This an either be done on a 
request basis or in some synchronized manner (such as giving the contents of I thru N in 
order with the proper synch). 

An additional complexity is that is that t-m constraints, world constraints, and 
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binary relationship constraint* can generate new labels. This must be handled by the* label 
bus either on a request basis Tor by tavlirtg all unused trt>«rm*nory sfots be assignable 
during their presentation-time oh the bus. Ate, the label rnemory must contain information 
about object identifications arid about the wor^lree. • 

Finally, there must be a procedure for *gro*%" new wires when a new datum is 
added to the data-base, otherwise it would be necessary to change pieces of hardware every 
time new data is added. Consider the CE machine to consist of a tessellation i of constraint- 
processor cells and point-registers. Each processor cell is of a fixed type (corresponding to 
the particular kind of primitive constraint Which the ceH implements), and has fixed wires 
attached to 1 through 4 point register! whidt it 'own** (the number of points being 
determined by the type of the constraint). Now, a hew oMuimVad&cf to the data-base by 
adding some new constraints. A new constraint "it "added" by selecting r a* currently-unused 
processor cell of the correct ty>* and Imtirig" i^ wned po^tslo the^ appropriate other 
points in the network. Wheh two point are Tintidl" it ni^t^ they represent the same 
class and hence their registers must be kept in the same states t-liey must be "wired 
together." Now, figures 2-lg and 2-lh show such a *wire" - itfls a partition constraint with 
only one subclass. Thus the task of "growing a'wire 1 * between two point-registers translates 
to the task of activating enough of the currently 4 unused parlioh* cwstralto to form a 
chain between the points. The cfiMin will actively propagate aff the labels from «ch of the 
original point-registers to the other, f o grow the chain* a <plan-liKe "wave front" of 
special labels which propagate only through unused partition constraints can be started 
from each of the original point-registers. The place where the two wave fronts first 
intersect is then known to be part of the ihortest path connecting the original points. The 
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unused partition constraint which is at the place of intersection can iec«£v§je itself, and 
propagate a wave of "activate yourself" labels back along both wave fronts. Those 
partition constraint* in the original wave fronts which do go£ receive! "f^tjyate" labels are 
not part of the chain being grown, and thiu remain unused. 

Of course, many details stiH remain to be worked out before s a cdjular CE 
machine could he built, even if current LSI technology Wau^^ of the task. 

■■■'.;.■■■*. 

B.2 Using Micrqproceiiors ("active pages") 

Since the cellular machines proposed above are not ||koty to exist for a while yet, 
it would be convenient to be able to use current technology to imptement the CE 
parallelism. One way to do this is to segment the .network into local "pages", each with its 
own microprocessor for propagating labels within tbf .jsjgJfcvSbe nejwoik within a page 
would be implemented a* alinked-Ust structure^so the problem of growing wires in cellular 
hardware does not occur Each processor has access tOrthe "b)bjlbtts"af above, in addition 
to a common "mail bus" which is used to e* port (and import) labels which cross page 
boundaries. 

It would not be unreasonable with present technology to have a l-page chip which 
contains the processor and the linked-Ust memory for a single page. These could then be 
stacked up to make the data-base, with more chips being added to thr stack as the data- 
base grows. 
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B.3 Using a Digital Computer 

A GE system has been implemented to provide the data-base and the low-level 
inference capability for the "MACSYMA Ad*&or* [Genesereth iitm* MACSYMA is a 
very complex system for doing sympatic mtthematiis; am! 'ItnTe advisor is a proposed 
subsystem to aid users when they need ne^p. The user wilt interact with the advisor in 
more-or-less natural English. The advisor uses the English input, the history of the user's 
interactions with MACSYMA, and the advisor's own knowledge about MACSYMA in 
general and this user in particular in order to formulate toadvUi. The advisor consist s of 
an English parser, a high-level problem solver, and a low-ifvel 1 data-base and inference 
capability <for which CE is used). Th* data-base contams essentially aH df the' advisor's 
information about hf ACSYMA and about the User . ©er^reth estimates that the CE data- 
base wW conUin about 20000 constraint*, s 

The CE system irsW for the advisor is Implemented in USP without any 
multiprocessing. Parallelism it simulated by having a priority quette of propagations ttt be 
done. With a sequential system, it is very imperamt to have good heuristics "f bt deciding 
which propagation to do next - doing them purely breadth-first (as parallel hardware 
would) is quite wasteful Two of the heuristics used affef ••< 

(1) Propagate "♦" labels in preference to ^" ones? For example;* is much less expensive to 
propagate V labels upwards in a taxonomy (sect* as figure W) bshi£ rote (pi) than it Is to 
propagate "-" labeU downwards using rule (p2). Basically, this heuristic says that is usually 
more informative to know what something u as opposed to what it is not. 

(2) Propagate existing labels in preference to generating new ones. This is a useful 
heuristic since the implementation is limited in terms of the number of different labels that 
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can exist at one time. In addition, it prevents inferences from wandering off into long 
nestings of relations (caused by rule b2), such as "my father's brother's political party's 
candidate." 

There are other heuristics which will be described in Genesereth's report on the 
advisor. 
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C Knowledge abourKhowledge * 

The problems involved with repr«enttrtf "Vnowtedge about knowledge" arc 
interesting both technically and philosophically; they are also quite difficult. This 
appendix shows ttd#»me of these problems can Wttititf«Fti^ using the CE "world" 
construct, which allows explicit statements to be made about various worlds (both physical 
and metaphysical). This appendix is divided into five sections: The first two deal with 
"belief"; the third ar.d fourth deal with "knowledge" (te. "true" beliefs); and the fifth 
briefly discusses modal logic The example used throughout this appendix is the following: 
"Billy knows who Jane's real father is, and she doesn't know that he knows." 

CI Belief 

To introduce the idea of "belief," this section uses a simplified version of the 
above example - the full version is used later. The simplified version is: "Billy believes 
that Jane's real father is John, and Jane doesn't believe that Billy believes it." Figure C-la 
represents this using CE 

Region (a) of C-la states that Jane's father is the object 'jf* (named acronymically). 
Without having any other information about 'jf' (which region (a) does not), all this says is 
that Jane has a unique father. 

Region (b) states that W-JF-J is the class of all worlds in which Jane's father 
equals John. What it literally says is that W-JF-J is the class of all worlds in which object- 
class 'jf is a subclass of the object class 'John'. (As has been mentioned many times, one 
object-class is a subclass of another if and only if the two objects are the same.) 

Region (c) defines W-BILLY to be the class of all worlds which are consistent 
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with Billy's beliefs. Of course, the relation class 'BELIEVES' has no a priori meaning to 
CE (just as 'FATHER^OF' does not), but ttjifra|sum«4 s th#i ; the user always uses 
'BELIEVES* to mejyi the relation between m individual and allthe world-objects which 
are consistent with that individual's beliefs. The. nair^/'BXLI EVES," is of course arbitrary 
and is not part of the data-base in any caw ,.,■* the impomnr structural feature of 
'BELIEVES* is that all rf ferences to an individual's beliefs art made via this class. I 
belabor this point only to emphasize that oottujf ney Juu been intjodu'-.ed - 'BELIEVES* 
is just an ordinary binary relation. 

Region (d) states that Billy believes that J^ne>. £||tier .»*; John. That is,, every 
world in W-BiLLY (i.e. every world constant with Billyjf be^fs) is also a world in 
W-JF-J (i.e. is a world in which Jane's father equ*b John). A* in icction 2J>. the use of the 
subclass constraint means that W-BILLY is stroog^ r than W-IF-I - Billy believe* at least 
that Jtnf'i father is John, and he may btlieve other things. 

Region (e) adds the constraints that Jane does not believe that Billy believes that 
her father is John. As with W-BILLY, the class W-JANE contains all woj-lds which are 
consistent with Jane's beliefs. The ria» t iW-BBJF*J Js all world* in whkh. Billy believes 
Jane's father is John (i* ail worlds in which W-BiLLY i* «f»nger than W-JF-J). The 
partition constraint then means that there is no potM i* W-JANE which U also in 
W-BBJF-J (i,e.ihe two classes are mutually; exclusive}, That is, none t pf lane's possible 
world-views allows for the possibility thaj "BMh; believe* ,,.." ; More literally, in every one of 
Jane's worlds it is the case that W'BILLY « not a Mibt?taw of W-JF-J. That is. in all of 
Jane> worlds there is some world in W-BILLY ifMltef^^'fe^^FW-JJF-J.wBdly has at least 
one possible world in which Jane's father ii qo£ John. 
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C.2 "I believe ..." 

There is one remaining problem with figure C«la. Region (e) states that in all of 
Jane's worlds it is hot the case that W-BiLLY is a subetais of W-JF-J, while region (d) 
states that in all worlds it is the case that the W-BILL^ is such a subclass - since the 
subclass constraint in tegton (d) is not relativized, it ii*enatiyj*for att worlds. Thus it is 
necessary to relativize region (d). Well, W-tBJFij is already defined as being exactly 
those worlds in which the subclass constraint holds: thai ^ the subclass constraint in region 
(d) can be deleted, and something new should be connected to W^BBJF-J. the question is. 
who is the one who believes that "Billy believes Ji the answer is that the data-base 
believes it. Therefore a world-class is needed to represent the data-base's "point of view" - 
call it T. Then figure C-lb shows what should be added when region (d) is deleted ~ it 
states that I (the data-base)1>elieve ffca*B*tly believe!™ Behavibrally. the class T is used by 
putting a "♦inf" label on it as part of the initial labeling, this enables whatever is attached 
to T, such as the W-BBJF*J. 

Now in one sense everything in the data-base is quahf ied by "I (the data-base) 
believe such-and-such" - after all, th* data^biae i^etf ran beSjewed as being an entity 
with a point of view, much as Bttly and Jane are. The reason for needing an explicit 
representation for "I (the data-base)" U that it may be necessary to represent oth*r points of 
view which conflict with the data-base's. 1ft Hie above example, jane's point df view 
regarding Billy's beliefs is different from the database's, and thb conflict is what 
motivated the introduction of T in the first place By having im explicit T, the data-base 
can keep track of the difference between facts ft b«li«v«s to be true in M worlds (including 
Jane's, for example), and facts which it believe* to be true only in its own worlds. The next 
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section deals with how the data-base a^t^^f^ l ^Jm.!^_^)f^.ot,XhK»^.faMny worlds 
are "true" and the some (like Jane's) are not true, i0 , 

C.3 Knowledge, God, and Wjsdom 

The simplified version of the example, represented in figure Jf>l, deals only with 
"belief." However, the original example deals wi^"I^wled|e":rn i ot only does Billy 
believe that Jane's father ;.is J^n, he kj)gws it. J}ne solution Js^ to, $ay. that whatever the 
data-base believes is necessarijy "true" (at lea^ insofar aj, the 4l^-bf« U concerned). This 
is quite reasonable - after all, how could ^^-^^ff.fjgp|lMs^'^ylMng;j|f it were 
in continual doubt about the validity of what it believed? Of course, there may be cases in 
which it is desirable to represent the fact lha| th^ dail-^e^nsiders i$*elf to be an 
unreliable source of information regarding certain ( topks.~ ,?his,can be handled by the 
"probability" mechanism described in section 3JS.?. So, in the context of the above example, 
the fact that it is true that Jane's father UJoJ^aQ^ represented by adding the fragment 
shown in figure C-lc: Now, both John anqVthe d^tbase^lfcye tha| Jane's father is John, 
which makes John's belief "true" (inso/ar^as the data-base jjLjconceiped). 

The problem w|th this scheme is th*t "tnith"^ Refined, te be exactly that wh^ch 
the data-base believes. It is reasonable to say that the data-ba«Voelief> contain ggly. true 
statements, but it is unreasonable to say .^a; ^datarb^se's beliefs contain ajl. true 
statements - the data-base certainly d«es not have. complete information about everything 
that is true in the universe. If the d^jaHbasj i does no|t ha^vf^hiJ i cppJejDR, jnformatiop, who 
does? The solution is to create a world-class W.^^jy^^ji^^^.^BiMain all the 
worlds which are consistent with "reality." 
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Now, it is desirable to retain the above notion that everything the data-base 
believes is true, even though the data-base's beliefs dtf ft©* entortiptss all truths. This is 
represented in figure C-2a. As usual, the direction of the subclass arrow is from stronger to 
weaker: God's beliefs (i.e. reality) are consistent with the dab-base's beliefs, but God 
believes many other ihings in addition. Note that W-COB rrogh* beso strong as to be a 
single object Representing W-GOD as an object^lasswwW mean that God allows only 
one possible universe. Although the issue of wtaer the universe is ^dne" or "many* might 
be of philosophical interest to some, it appears to have no technical importance here - I 
have not yet found any cases for which it makes a difference whether or not VV-GOI) is an 
object-class. 

Using W-GOIX makes it possible to abstractly describe two different aspects of an 
entity's "wisdom." The first aspect is that everything the entity oeheves is in fact true - 
this is shown in figure C-2a. The second aspect is that the entity knows all there is to know 
(i.e. all truths) -- this is shown in C-2b. The combination of the two of course means that 
the entity knows exactly what God does. NoW, this notion Tbf ^absolute wisdom" is clearly 
not very useful - often someone is considered to be wise only with i r&pect to a given 
subject area. One possible soluTSon is to divide God's knowledge into several domains - 
figure C-2c divides kndwtedge into the domains of "accounting," "mathematics," and 
"other." Then, to say that Billy knows everything there is to know about accounting, the 
network in figure C-2d can be used. Of course, domains such as "accounting" are much too 
large for most purposes - they dn be further divided until an "appropriate site is reached 
(such as "Billy knows air there is to know about accounting for mergers using the pooling- 
of-interests technique"). ' 
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Of course, just using the name "W-ACCOUNTINQ" dcx* not mean that the 
data-base thereby knows what the dow« of acwwnling iv It may be necessary for some 
appUouwm to constrain W-ACCOUNTING appropriately - tM« u a topic for future 
research. 

C4 InterworW Objects 

The above discussion of W-COD does not seem to have much direct relevance to 
the example of Jan*'* father - everything work* cqrrecUy by J^ wing the cofutruption in 
figure C-lc (without Deeding to introduce V/rOQDl As, detailed above, ^tys construction 
involves the assumption that the data-bsue knowj everything that i* true, but this 
assumption causes no difficulty in the example because, M* fact the date-base doevknow all 
the relevant facts. However. figure C-i doe* not quite handle Jthe original example, which 
it "Billy knows who Jane'i father is...", not "Billy knpw* Jaoe't fa^ier is John." The 
problem is that Billy may know that Jane'j father i* John (or whoiver), but the data-base 
does not know it. That is, there is an entity (Billy) who knows more than the data-base, 
which means that the data-base can not be used as the arbiter of truth. This is why 
W-COD is needed for the example. 

A rephrasing of the relevant part of the example if %Uy beUaves 0at Jape's 
father is qc, and Jane's father u in facto." That^i* both Bitty f #nd God believe that Jane's 
father is identical to the object V, but the data-bat* dot* not know anything else about V. 
As a first attempt at representing this, consider, figure C^, which is meant to repUce the 
relevant parts of C-Ja, 

The problem with C-3a is that, the object-class V can be- a different extensional 
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object in different worlds, just as in figure 2*3a 'the-pmidem-of-the-US' can be different 
people at different times. In particular, V can be-different for Ced and for Silly, which 
goes against th? idea that God and Bitty should have the same V. The solution is to 
introduce a new primitive constraint, the "interworld object* constraint, the symbol for 
which is shown in figure C-3b. This constraint acts the same as the ordinary object 
constraint, except it always represents the same extensional object in all worlds. Its label- 
propagating behavior is the same as the normal object constraint's rule (ol) -- it broadcasts 
an "«obf label. The difference is that cotllJierw between sueJrn llibet and a "♦obJ2" label 
can occur regardless of whether or not the rw©^Mbe1r1»aw«fi# s*me wortcPtag. Thus 
objects in different worlds can become identified with each other. This befiivior 
implements the fact that the intewortd object (an intensional construction) represents the 
same extensional object in alt worlds. By making the bb^dHcliss^" an interwbria object, 
the example is completed: The data-base knows that Billy knows the identity of Jane's 
father, without the data-base itself knowing that identity. 

C.5 Modal Logic 

Modal logic deals with (among other things) the distinction between "necessary" 
truths and "contingent" truths. A necessary truth it one that is true from the definitions of 
the terms used - for example, it is necessarily true that all wows are birds, if we use 1*ow" 
and "bird" with their normal rneanings. However, it is only contingently true that alt crows 
are black - no logical laws would be violated if i pink «ro# appeared tomorrow. Modal 
logics are systems in which the distinction between "necessary" and "contingent" can be 
explicitly specified. Brt this sense, Clean bettsed s iraiinoda»%igi£ iftgur#€4a states that 
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all crows are necessarily birds - the subclass constraint holds in all p omblc work}*. Figure 
C-4b states that all crows are contingently black - the subclass constraint holds in this 
reality, but it might not hold in some other. That is, the data-base allows for the possibility 
that in some worlds it might hot be the case thaL»H crows are bjack, 

Another aspect of modal logic de»l» with notions such as "want," as in "John 
wants Jane to be with him," A rough tranila^ion of this js tba| John "desires" a world in 
which Jane is with him. This can be represented directly by introducing a binary relation 
DESIRES with the same form as BELIEVES - it relates an individual to a class of worlds. 
Working out the details of this is a topic for future research. 
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D Design Decisions 

As With any research project, certain mare-oHess arbitrary design decisions had 
to be made during the formulation of C£ in order that the Work might proceed, this 
appendix briefly discusses some of these decision* and some alternatives to them. The 
main reason for including this appendix is that -tor certain applications of CE, some of 
these alternative designs might be preferable to the ones described in the body of this 
document. 

D.1 Other Labels 

There are alternatives to the use of V, "-", and "-" labels. One such alternative 
is to eliminate the "-" label. The only interesting propagation rule which this change would 
eliminate is rule (b5), and for some applications this rule might not be necessary. Rules (b6) 
and (b7> were included only for completeness - they refer to relations which contain only a 
single ordered pair, and this notion has not yet proved to be useful. 

If "-" is eliminated, it is necessary to reformulate the process of object 
identification. With "-", identification occurs when an V collides with a V (or another 
"-">. Without "-", identifications occur when a V reaches an object-class. Thus the "♦" will 
have to come to the object, instead of the V and the object's "-" being able to meet "half 
way." This may reduce the number of object identifications - whether or not this is 
important depends on the particular application. 

Another alternative is to add the V label If this label is on a class, it means 
that the class is empty. Here, V is a notation for "all objects," so "V means that all objects 
are known to be not in the class. This label fits in well with the existing propagation rules 
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which involve ■"-" - after all, "-" means that the class is almost ; ^mgty. Fof instance, 
rules (p5) and (u5) can be augmented; to pu|*i" label* job, |hj classes cyrrefltly labeled "-" 
since they are necessa/ily empty. This add$«fl§l ^Jprfn^iqn may be of use in some 
applications, particularly wher? there ap; many empty#u*t*. »= 

A further alternative is to scrap, all of^the e^is^gjlabels in,, favor of- a different 
scheme. The existing labels all refer to objects - a different Khero* can be used in which 
the labels refer to cfaws. In one such Kh^me, there are three label*: 
VA" on a class 'C means that 'A* is a subclass of *C'; 
%A" on 'C means that X? is a subclass of 'A'; 
"•A" on 'C* means that 'A' and 'C'are mutually exclusive. 

In this scheme, V" corresponds roughly to V; V corce^^ a 

combination of >A" and %A" On the same point "C! corresponds .0 "-" (since each d§ss is 
known to be a subclass of the other)., The propagation rules for the primitive constraints 
can be modified to handle these labels appropriately. The problems with this scheme stem 
from the fact that it does not refer to objects. For one thing, it is unclear how the idea of 
"object" (e.g. single-member class) can be formulated at all - some other kind of label is 
probably needed. For another thing, it is not knowable whether the classes bejng referred 
to are empty or not - all the classes could be empty and the labels would still propagate. 
ThU actually might be an advantage for applications where it is required that all classes in 
the network be non-empty. 
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D.2 Other Objects 

Currently, GE has two kinds of objects which can appear in labels: simple objects 
such as "x", and ordered pairs such as "<x,y>* It may be useful to introduce other kinds of 
objects for certain applications. For example, consider integer arithmetic Special integer- 
objects "3", Mr, etc art be defined, and primitives such as # «jm 4 and "difference" can be 
defined which use them. For handling inequalities, object* can be defined to represent 
integer intervals, and these can be manipulated by constraints which express the various 
inequality relations. ■*■■■■ 

Another kind of object is the nested ordered pair, such as "<a,«b,o,d»". 
Nothing in this document has required them, yet nothing explicitly prohibits them either. 
It clearly complicates things to allow labels which are arbitrarily deep nestings of pairs, but 
this complexity might be worthwhile in some cases. For one thing, USP-like structures can 
be built by defining a primitive constraint tor CONS which can be used to put these pairs 
together (and take them apart). For another thing, quantification could be performed 
without using the t-m constraint. The basic idea is to introduce a primitive for relation 
composition, and let the bound (quantified) objects be explicitly carried along in nested 
tuples. This technique is equivalent to Skolemization in first-order logic It is not hard to 
work out the deiails of such a scheme for relation composition - such details are not given 
here primarily because the resulting expressions seem to be very unnatural and awkward to 
use. '" • : "'' "' ' : ' u ' 
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D.3 Other Primitive Constraints 

Since CE is highly modular, it is possible to introduce a new primitive without 
having to worry about how It will interact with all the existing primitives. Indeed, 
throughout this document new primitives have been repeatedly introduced or proposed 
(e.g. in the immediately preceding section). Since CE is built entirely 4»ut of label-objects 
and primitive constraints, any addition to CE will be either in terins cf new objects, new 
constraints, or both. 

One such possible addition involve* "procedural attachment" as described in 
section 5.4.2. Within the CE framework, this involves defining a new constraint which 
behaves normally - it looks for appropriate patterns of labels oj* its attached class-points, 
and propagates new labels when such patterns occur. However, inside this constraint might 
be an arbitrary procedure for accessing the outside world either to receive information or to 
produce effects. For example, the class 'PERSONS' might be tied via such a procedure to 
an external file which lists all the persons. A "*©bj/inf" label reaching ihis class causes the 
procedure to add 'obj' to the list of persons, and a "-obyinf" causes an inconsistency if in 
fact 'obj'U a person on the list. 

The major limitation on the power of such a procedure is that is must have a 
well-defined semantics in term* of label propagations. In the •PERSONS' example, the 
semantics is easy to express since the file in the outside world corresponds quite directly to a 
CE class, but more complex procedural interaction with the outside world will certainly be 
more difficult to express in CE's terms. This is an area for future research. 
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